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Description 

Field of the Invention 

5 The present invention relates to genes encoding proteins having aromatic acyl group transfer activity and the use 

thereof. More particularly, the present invention relates to genes encoding proteins having aromatic acyl group transfer 
activity derived from gentians ( Gentiana trrflora var. japonica), petunias (Petunia hybrids), perillas (Perilla pcimpides). 
and cinerarias (Senecio cruentus) . and the use thereof. 

10 Background Art 

The flower industry is making efforts to develop new and various varieties. An effective method of producing a new 
variety involves changing the color of a flower, for which the traditional breeding methods have been successfully 
employed to produce a wide variety of colors for almost all commercial varieties. With the above methods, however, it 
15 is rare that a single species produces colored varieties coming in a wide range of different colors since a pool of genes 
is limited for each species. 

The colors of flowers are based mainly on two types of pigments, flavonoids and carotenoids. Flavonoids contribute 
mainly to the colors in the range of yellow to red and blue, while carotenoids contribute to the color tones of orange or 
yellow. Flavonoid molecules that make a major contribution to the color of flowers are anthocyans which are glycosides 

20 of cyanidin, detphinidin, petunidin, peonidin, malvidin, and pelargonidin. Different anthocyans impart marked changes 
in the color of flowers. Furthermore, the color of flowers is affected by copigmentation with colorless flavonoids, metal 
complex formation, glycosylation, acylation, methylation and pH of vacuoles (Forkman, Plant Breeding 106: 1 , 1991). 

There are a number of reports of acylated anthocyans isolated from nature including cinerarin derived from ciner- 
arias (Senecio cruentus) (Goto et al.. Tetrahedron 25: 6021, 1984), awobanin derived from dayflowers (Commelina 

25 communis) (Goto and Kondo, Angew. Chem. Int. Ed. Engl. 30: 17, 1991) and gentiodelphin derived from Gentiana 
Makinoi (Yoshida et al., etrahedron 48: 4313. 1992) (Monarda didyma: Kondo et al.. Tetrahedron 26: 5879, 1985; peril- 
las, pansies (Goto et al., Tetrahedron 27:2413. 1987; Wandering Jew: Idaka et al.. Tetrahedron 28: 1901 , 1987; Diosco- 
rea japonica : Shoyama et al.. Phytochemistry 29: 2999, 1990; red cabbage, Platvcodon grand if lorum . lobelia, 
delphiniums, butterfly peas: Goto and Kondo, Angew. Chem. Int. Ed. Engl. 30:17. 1991 ; carrots: Glabgen et al., Phyto- 

30 chemistry 31 : 1593, 1992; morning glory: Lu et al., Phytochemistry 32: 659, 1992; Sarto et al., Phytochemistry 40: 1283, 
1995; Aiuoa decumbens. Clinopodium oracile. Lamiums. lavender, catnip, Leonurus macranthus. Plectranthus. Prunel- 
las. Salvias splendens Sella. Janapnese Artichoke: Sarto and Harborne. Phytochemistry 31: 3009. 1992; giant water 
lily: Strack et a).. Phytochemistry 31: 989. 1992; bellflowers: Brandt et al., 33: 209. 1993; gentians: Hosokawa et al., 
Phytochemistry 40: 941. 1995; hyacinth: Hosokawa et a).. Phytochemistry 40: 567, 1995). 

35 Acyl groups which modify these anthocyan-containing flavonoids are divided into two classes based on their struc- 
ture: one is the aromatic acyl groups centering on hydroxy cinnamic acids, and the other is the aliphatic acyl groups 
such as the malonyl group. It has been observed in the experiment carried out using the anthocyanin pigment of morn- 
ing glories (Pharbitis nil) that among the acyl groups transfer reactions anthocyans to which an aromatic acyl group, 
preferably coumaric acid or caffeic acid, is bound show a shift of the absorption maximum to the long wavelength side 

40 (Dangle et al.. Phytochemistry 34: 1 1 19. 1993). 

Furthermore, tor cinerarin derived from cineraria (Senecio cruentus) which has one aliphatic acyl group and three 
aromatic acyl groups, h has been reported that the stability of the pigment decreases in a neutral aqueous solution by 
removing aromatic acyl groups (Goto et al.. Tetrahedron 25: 6021, 1984). For gentiodelphin derived from gentians 
( Gentiana makinoi) also, it has been reported that an intra-molecular stacking of the sandwich type occurs due to the 

45 presence of two aromatic acyl groups in the molecule, which results in stabilization of the pigment in an aqueous solu- 
tion (Yoshida et al.. Tetrahedron 48: 4313. 1992). Moreover, Yoshida et al. have demonstrated that each of glucose at 
position 5 and glucose at position 3' of anthocyanin has an acyl group bound thereto (Tetrahedron 48: 4313, 1992). It 
has also been reported that anthocyanin in the leaves of perillas (Perilla ocimoides) is shisonin in which coumaric acid 
is bound to glucose at position 3 of cyanidin 3,5-diglucoside (Tetrahedron Letters 27: 2413-2416. 1978). 

so However, these studies have been carried out from the aspect of organic chemistry such as structural studies of 
natural pigments and not from the aspect of biochemistry such as efforts to isolate enzymes which transfer acyl groups. 

Of the transferases which transfer acyl groups to anthocyanin pigments, there are many reports on the malonyl 
group transferases which transfer an aliphatic acyl. including those from a cell culture of parsley (Matern et al.. Arch. 
Biochem. Biophys. 208: 233, 1981; Matern et al., Arch. Biochem. Biophys. 226: 206, 1983; Matern et al., Eur. J. Bio- 

55 chem. 133: 439, 1983), seedlings of Cicer arientium (Koster et al., Arch. Biochem. Biophys. 234: 513, 1984), and the 
like. 

Aromatic acyt transfer reaction was first reported for Silene, a member of Caryophyllaceae (Kamsteeg et al.. Bio- 
chem. Physiol. Pflanzen 175: 403. 1980). and the activity of aromatic acyltransferase has similarly been found in the 
soluble enzyme fraction of Matthiote (Teusch et al.. Phytochemistry 26: 991 ,1986). 
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However, these reports have been limited to a mere demonstration of the presence of enzymatic activity, and nei- 
ther the corresponding enzyme proteins have been specified nor findings have been obtained on the primary structure 
of the enzymes much less the genes encoding them. For other aromatic acyl transferases as well no reports have elu- 
cidated the primary structure of proteins or genes. Furthermore, there are no reports of examples in which the acylating 

5 reactions of anthocyanin pigments were positively used to expand the range of colors of flowers and to grow them, or 
examples in which acylation was used in an attempt to stabilize anthocyanins. 

On the other hand, the biochemical pathway of synthesis of anthocyanins of Petunia hvbrida has been well studied 
(Wiering, H. and de Vlaming, P. Inheritance and biochemistry of pigments. Petunia, P49-65 (1984), Griesbach, R.J., 
asen, S. and Leonhardt, B.A.. Phytochemistry, 30: 1729-1731 , 1991). and the presence of anthocyanins which contain 

10 an acyl group is known. As the acyl group of anthocyanins of Petunia, coumaric acid or caffeic acid is known. One mol- 
ecule of coumaric acid or caffeic acid is bound to rutinoside at position 3 of anthocyanin, whose chemical structure, 
when the anthocyanidin is malvidin, has been assigned to 3-0-(6-0-(4-0-coumaroyl)-a-D-glucopyranosyl)-5-0-p-D- 
gluc opyranosyl-malvidin and 3-0-(6-0-(4-0-caffeoyl)-a-D-glucopyranosyl)-5-0-f>D-gluco pyranosyl-malvidin, respec- 
tively. However, there were no reports on anthocyanins having two acyl groups. 

15 

Disclosure of the Invention 

The present invention relates to genes encoding proteins having aromatic acyl group transfer activity and the use 
thereof. Thus, with regard to said use, there is described a method for controlling an acyl group transfer reaction to fla- 
20 vonoids. preferably anthocyanins, which provides a possibility of developing a wide range of flower colors for a single 
species. In particular, said method is considered to be useful for imparting bluish tints to the existing color of flowers, 
because the absorption maximum of anthocyanin shifts to the long wavelength direction by transfer of aromatic acyl 
groups. 

In order to realize the above technology, it is necessary to elucidate the identity of enzymes responsible for aro- 
25 matic acyl transfer reactions and to separate the cDNA which encodes said enzymes. Furthermore, by utilizing the 
homology of genes it is possible to separate the genes of other acyl group transfer enzymes. Moreover, production of 
stable anthocyanin pigments can be realized by acylation since acylation leads to increased stability of anthocyanins. 

The inventors have isolated an acyltransferase from petals of gentians and determined the primary structure 
thereof. Furthermore, using recombinant technology we have also isolated cDNA's of acyltransferases of gentians, 
30 petunia, perillas. and cinerarias, and determined the nucleotide sequences of the structural genes. Thus, the present 
invention provides DNA sequences encoding acyltransferases which are present in the petals of gentians, petunias, 
and cineraria, and leaves of perillas. Furthermore, the enzymes of the present invention can be used to change the 
colors of flowers by acylating the anthocyanin pigments and to increase stability of anthocyanins. 

as Specific Description 

Genes encoding acyltransferases may be obtained, for example, as follows. Thus, first an acyltransferase is puri- 
fied from petals of gentians. Prior to the present invention all attempts to purify aromatic acyltransferases have failed. 
The inventors of the present invention have succeeded in purifying said enzyme for the first time by employing various 
40 chromatographic methods, especially affinity chromatography using a resin (for example. Blue Sepharose (TM) resin, 
etc.) on which is immobilized, for example, Cibacron Blue 3GA. 

Then, the partial amino acid sequence of the acyltransferase is elucidated using the conventional method and a 
synthetic nucleotide corresponding said amino acid sequence is prepared. 

On the other hand, poly A + RNA is extracted from petals of the same gentian, from which double stranded cDNA 
45 is synthesized using the conventional method and a cDNA library is further produced. Using the above double stranded 
cDNA as the template a DNA fragment specific to the gene of acyltransferase is obtained by the PCR method using the 
synthetic DNA primers which were used for synthesis of said synthetic DNA and cDNA. Then, using this DNA fragment 
as a probe, the above mentioned cDNA library is screened to obtain positive clones. Plasmid DNA which is recovered 
from the clones are separated and their nucleotide sequences are determined. Then the amino acid sequence obtained 
so from analysis of the purified acyltransferase and the amino acid sequence of the acyltransferase deduced from the DNA 
nucleotide sequence are compared to confirm that the above positive clone is the desired cDNA clone. 

The inventors have also found petunia mutant (VM) a mutant strain of petunia var. Surfinia purple (VM) (Suntory 
Ltd.), in which the color of the flower has been changed from red purple to purple, and determined the structure of 
anthocyanins according to the method as described by, for example, Yoshida et al. (Yoshida et al., Tetrahedron 48: 
55 4313. 1992); 

As the DNA of the present invention there is mentioned DNA encoding the amino acid sequence as set forth in any 
of SEQ ID No. 1 to 6. However, it is known that proteins having modified amino acid sequences in which several amino 
acids have been added, removed and/or replaced with other amino acids have enzymatic activity similar to the original 
protein. Accordingly, genes encoding proteins which have modified amino acid sequences wherein one or more amino 
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acids have been added, removed and/or replaced with other amino acids and which retain aromatic acyl group transfer 
activity are encompassed in the present invention. 

The present invention also relates to genes encoding proteins which hybridize with the nucleotide sequence as set 
forth in any of SEQ ID No. 1 to 6 or a portion thereof, for example the portion encoding six or more amino acids of the 

s consensus region, under the condition of, for example 2 to 5 x SSC and 50°C, and which have acyl group transfer activ- 
ity. Furthermore, the optimum hybridization temperature depends on the nucleotide sequence and its length. Preferably 
the hybridization temperature becomes low, as the nucleotide sequence becomes short. For example, in the case of the 
nucleotide sequence (18 bases) encoding six amino acids, a temperature of 50°C or lower is preferred. The present 
invention also relates to genes encoding proteins having the amino acid sequence which has a homology of 15% or 

10 higher, preferably 25% or higher, for example 30% or higher with the amino acid sequence as set forth in any of SEQ 
ID No. 1 to 6, and which has aromatic acyl group transfer activity. 

The DNA which has the original nucleotide sequence is obtained, as specifically described in Examples, by screen- 
ing, for example a cDNA library. 

DNA encoding the enzyme having a modified amino acid sequence can be synthesized by the conventional site- 

75 directed mutagenesis or a PCR method based on the DNA having the original nucleotide sequence. For example, a 
DNA fragment having a site which is desired to be modified is obtained by digestion with restriction enzymes of cDNA 
or genomic DNA obtained as above, which is then used as the template to obtain the DNA fragment having the desired 
modification inserted therein by site-directed mutagenesis or a PCR method, and by ligating this to the DNA which 
encodes other parts of the desired enzyme. 

20 Alternatively, in order to obtain DNA encoding an enzyme having a shortened amino acid sequence, DNA encoding 
an amino acid sequence longer than the desired amino acid sequence, for example DNA encoding the full-length amino 
acid sequence is cut with the desired restriction enzyme. When the resulting DNA fragment does not encode the 
desired entire amino acid sequence, the missing portion can be complemented by ligating synthetic DNA. 

A gene encoding acyltransferase according to the present invention can be obtained by expressing the above clone 

25 in Escherichia coli and yeast using gene expression systems, confirming that the gene obtained encodes acyltrans- 
ferase, and elucidating the translation region of the gene of acyltransferase. Furthermore, by expressing said gene a 
genetic product, the protein of the desired acyltransferase can be obtained. 

Alternatively, it is also possible to obtain said protein using an antibody against the amino acid sequence described 
in any of SEQ ID No. 1 to 6. 

30 Thus, the present invention relates to a recombinant vector comprising said DNA, in particular an expression vec- 
tor, and a host transformed with said vector. As the host, a eukaryotic or prokaryotic organism may be employed. The 
prokaryotic organisms which may be used include a bacterium belonging to the genus Escherichia, for example. 
Escherichia coli a bacterium belonging to the genus Bacillus, for example Bacillus subtilis. or any other conventional 

35 The eukaryotic organisms which may be used include lower eukaryotes. for example eukaryotic microorganisms, 
for example fungi such as yeast or filamentous fungi. As the yeast, there are mentioned Saccharomvces such as Sac- 
charomvces cereviciae . and as the filamentous microorganisms, there are mentioned Aspergillus such as Aspergillus 
oryzae and Aspergillus nicer, and Penicillium. and the like. Moreover, animal cells or plant cells may be used. The ani- 
mal cells which may be used include cell lines of mouse, hamster, monkey, human, and the like. Furthermore, insect 

40 cells such as silkworm cells or larvae of silkworm themselves may be used as a host. 

The expression vectors of the present invention contain expression regulating regions, for example promoter and 
terminater, replication origin, and the like depending on the kind of the host to which they are introduced. As a promoter 
for bacterial expression vectors, conventionally used promoters such as trc promoter, tac promoter, lac promoter, etc. 
may be used. As a promoter for yeast for example, glyceraldehyde-3-phosphate dehydrogenase promoter, PH05 pro- 

45 moter, and the like may be used. As s promoter for filamentous organisms, for example, amylase, trp C. and the like 
may be used. As a promoter for animal cell hosts, viral promoters such as SV40 early promoter. SV40 late promoter, 
and the like may be used. 

. The construction of an expression vector may be carried out according to a conventional method using restriction 
enzymes, ligase. and the like. Transformation of hosts with an expression vector may also be carried out according to 
so a conventional method. 

In the manufacture of said proteins, the desired protein can be obtained by culturing, growing, or breeding a host 
transformed with the above-mentioned expression vector, and then subjecting the culture to gel-filtration, centrH ugation, 
cell disruption, gel-filtration chromatography, ion exchange chromatography, and the like to recover and/or purify said 
protein. 

55 Although the invention has been described with specific reference to acyltransferases derived from gentians, petu- 
nias, perillas, and cinerarias, it should be noted that the purification method of said enzyme may be wholly or partially 
modified to purify acyltransferases of other plants and then the amino acid sequences of said enzymes are determined 
in order to clone genes encoding said enzymes. By using as a probe cDNA of acyltransferase derived from a gentian 
according to the present invention, it was also possible to obtain cDNA of another acyltransferase from a gentian and 
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cDNA of another acyltransferase from a petunia. Accordingly, by using part or all of the gene of acyltransferase it is pos- 
sible to obtain the gene of another acyltransferase. Comparison of these amino acid sequences revealed the presence 
of a region of a conserved amino acid sequence. By using this region it was also possible to obtain cDNA of acyltrans- 
ferase of a perilla and a cineraria. A similar method can be applied to other plants to obtain cDNA or chromosomal DNA 

5 clone of a similar acyltransferase. 

As has been described hereinabove, by purifying acyltransferases derived from a gentian, a petunia, a perilla and 
a cineraria and then obtaining antibody against said enzyme according to a conventional method, it is possible to clone 
cDNA or chromosomal DNA which produces a protein capable of reacting with said antibody. Thus, the present inven- 
tion is not limited to acyltransferases derived from gentians, petunias, perillas and cinerarias, but relates broadly to aro- 

io matic acyltransferases. 

Furthermore, the present invention relates to plants of which colors have been controlled by introducing gene of 
acyltransferase thereinto, or progenies thereof or their tissues, and they may be in the form of cut flowers. 

Furthermore, in the present specification. CoA esters such as p-coumaroyl-CoA or caffeoyl-CoA etc. were men- 
tioned as a donor of an acyl group in the acyl group transfer reaction of flavonoids involving anthocyanins, further more 
is p-coumaroyl, ferulloyl, or hydroxycinnamoyl-1-O-glucose such as sinapoyl-1-O-glucose can also be used as a donor of 
an aromatic acyl group (Glassegen and Seitz, Planta 186: 582, 1992), and therefore enzymes according to the present 
invention can be used. 

Examples 

20 

The present invention is now explained with reference to the following specific embodiments. The experimental pro- 
cedures used were according to Molecular Cloning by Sambrook (Cold Spring Harbor Laboratory Press. 1989), unless 
otherwise specified. 

(1) Preparation of substrate 

Delphinidin 3,5-diglucoside and cyanidin 3,5-diglucoside were obtained from petals of Tapian violet (Suntory Ltd.), 
30 a breed of Verbena hvbrida. by extracting a diacetylated form of each of the above and then by deacetylating them. Pet- 
als (348 g) of Tapian violet were homogenized with liquid nitrogen in a homogenizer, immersed in 1.5 L of 50% (v/v) 
acetonitrile and 0.2% trifluoro acetic acid (TFA), and then allowed to stand for three days. 

The product thus obtained was filtered under aspiration through diatomaceous earth (#100) spread over filter paper 
then concentrated to half the volume in a rotary evaporator, followed by gel-filtration with HP-20 (Pharmacia). After 
35 washing with 800 ml of distilled water, the pigment fraction was eluted with 800 ml of 50% acetonitrile and 0.1% TFA. 
After concentration in an evaporator, it was lyophilized to obtain crude pigment (7.3 g). 

Since the main pigments in Tapian are 3,5-diacetylglucoside of delphinidin and cyanidin. the following procedure of 
deacetylation was carried out. One gram of the crude pigment was dissolved in 50 ml of methanol and aerated with 
nitrogen gas for 15 minutes to expel dissolved oxygen and then cooled on ice. 
40 Separately, dissolved oxygen was similarly expelled from 50 ml of 1 N sodium hydroxide, into which the above pig- 
ment solution was added dropwise while stirring in the ice, and was stirred for further 30 minutes to effect hydrolysis. 
One ml of 6 N hydrochloric acid was added thereto to stop the reaction. Then, 5 ml of distilled water was added and 
concentrated to half the volume in an evaporator, to which methanol was added to a final concentration of 1 0%; Two ml 
aliquots were applied to Sep Pac C1 8 column (Waters Association), which was then washed with 5 ml of distilled water. 
as and eluted with 2 ml of 30% acetonitrile and 0.6% TFA. 

All eluates were collected and concentrated in an evaporator, and then fractionated by HPLC. Using a DEVELOSIL 
ODS-10/20 (50 x 300 mm; Nomura Kagaku K.K.) column, elution was effected at a linear gradient of TFA from 0.1% to 
0.3% and acetonitrile from 10% to 30% over 120 minutes. Fractions were collected every 0.5 minute at a flow rate of 32 
ml per minute. Absorption spectrum of each pigment faction was measured to separate delphinidin-3,5-diglucoside and 
so cyanidin 3,5-diglucoside, which were then concentrated and lyophilized (de!phinidin-3,5-diglucoside, 75 mg and cyani- 
din 3,5-diglucoside, 50 mg). They were each dissolved in 0.5% TFA to a concentration of 1 .5 mg/ml and stored at -80°C 
until use. 

Synthesis of another substrate, hydroxy cinnamoyl-CoA was earned out in the following manner. First, an ester was 
synthesized from caffeic acid (Nakalai tesque) and N-hydroxysuccinimide (Merck) according to a literature (Stockigt 
55 and Zenk, Z. Naturforsch. 30: 352. 1975). This ester (0.5 mmol) was dissolved in 2 ml of acetone. Separately 0.1 mmol 
of Coenzyme A (CoA: KOHJIN) and 1 mmol of sodium hydrogen carbonate were dissolved in 20 ml of water, to which 
was added dropwise the ester solution prepared above. 

After the mixture was reacted overnight while stirring under nitrogen gas at room temperature, it was concentrated 
in a rotary evaporator and centrifuged (27,000 x g, 1 0 min) to remove insoluble matter and the desired product was col- 
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lected using HPLC. Using a DEVELOSIL ODS-10/20 (50 x 300 mm; Nomura Kagaku K.K.) column, elution was carried 
out at a linear gradient of acetonitrile from 18% to 36% in the presence of 0. 1% of TFA over 40 minutes. Fractions were 
collected every 0.8 minute at a flow rate of 32 ml per minute. The absorption spectrum of each fraction was measured 
(200 to 400 nm) to colled the fractions having an absorption maximum in the range of 344 to 348 nm as the caffeoyl 

5 CoA fraction. After concentration in a rotary evaporator, they were separated using the same column again. 

However, elution was carried out by isocratic chromatography of 18% acetonitrile and 0.1% TFA, and the absorp- 
tion spectrum was measured simultaneously to concentrate the fractions containing the desired compounds in a rotary 
evaporator, which were then lyophilized. This method produced 35 umol of the products. By substituting coumaric acid 
for caffeic acid above, p-coumaroyl-CoA was synthesized. The product was dissolved in distilled water at 2 mg/ml and 

w stored at -80° C until use. 

(2) Extraction method of the crude enzyme solution 

Three grams of the plant tissue (petals, edible parts, etc.) from which enzyme was to beextracted was frozen in 
is liquid nitrogen and was homogenized in a mortar, tt was further homogenized by adding 10 ml of the extraction buffer 
(100 mM phosphate buffer, pH 7.5, 10 mM sodium ascorbate, 14 mM 2-mercaptoethanol) and was filtered through 
three layers of gauze. After adding 3 g of DOWEX (1-X2, 100-200 mesh; Muromachi Kagaku Kogyo K.K.) and stirred 
for 10 minutes, the resin was removed by filtration under aspiration and the debris of the plant tissue was removed by 
centrrfugation (27,000 x g, 20 minutes). It was then subjected to salting out under 70% saturated ammonium sulfate to 
20 precipitate proteins. The precipitate was suspended into 1 ml of the solubilizing buffer (20 mM phosphate buffer. pH 7.5, 
14 mM 2-mercaptoethanol) and insoluble matter was removed by centrifugation (27,000 x g. 5 minutes). Then it was 
desalted using Sephadex G-25 column (NAP-10; Pharmacia) which had been equilibrated with the solubilizing buffer 
and the solution thus obtained was used as the crude enzyme solution. 

25 (3) Method of measuring enzyme activity 

Fifty |il of a reaction mixture containing 100 mM phosphate buffer, pH 8.5, 24 nmol of delphinidin 3,5-diglucoside. 
21.5 nmol of caffeoyl-CoA, and 20 nl of the enzyme solution was reacted at 30°C for 10 minutes. After stopping the 
reaction by adding 50 ui of acetonitrile containing 13.8% (v/v) acetic acid and insoluble matter was removed by centrrf- 
30 ugation (18.000 x g. 5 minutes), it was analyzed by high performance liquid chromatography (HPLC). Twenty pi of the 
reaction mixture was analyzed using a C1 8 reverse phase column (YMC-Pack ODS-A, 6.0 x 150 mm; YMC) and 21.6% 
acetonitrile and 0.1% trrfluoroacetic acid at a flow rate of 1 ml per minute. The compounds were detected using a three 
dimensional chromatography system (CLASS-LC10, Shimazu Seisakusho, K.K.) and it was found that the product has 
an absorption maximum at about 330 nm which is absent in the substrate and that the absorption maximum in the vis- 
as ible light range shifted by about 6 nm from 51 9 nm to 525 nm, confirming that an acyl group (caffeic acid) is bound, and 
delphinidin 3-glucosyl 5-caffeoyl glucoside has been produced. 

By detecting at a wavelength of 520 nm. the ratio of the peak area of the product (delphinidin 3-glucosyl 5-caffeoyl 
glucoside) to the sum of the peak areas of the substrate (delphinidin 3,5-diglucoside) and the product (delphinidin 3- 
glucosyl 5-caffeoyl glucoside) was determined to calculate the mole number of the product, which was defined as the 
40 enzymatic activity (kat). The retention time for each compound in this HPLC analysis was as follows: caffeoyl-CoA. 6.3 
min; delphinidin 3,5-diglucoside, 3.3 min; delphinidin 3-glucosyl 5-caffeoylglucoside, 5.3 min. 

Since under this reaction condition delphinidin 3,5-diglucoside in the reaction mixture is modified with caffeic acid 
by the action of acyltransferase resulting in color change of the reaction mixture from dark blue to reddish purple, the 
enzymatic activity can be determined, as a simple method, by carrying out the reaction in a microliter plate. 
45 When the plate after the reaction is allowed to stand at room temperature for a prolonged period of time (one day 
to one week), delphinidin 3,5-diglucoside which was not acytated becomes colorless, whereas the delphinidin 3.5-diglu- 
coside which was acytated by the action of the enzyme retains the reddish purple color, so that stabilization of delphi- 
nidin 3,5-diglucoside in a neutral to alkaline solution was observed because of its acylation. Similarly, when cyan'dine 
3,5-diglucoside was used as the substrate, the cola of the reaction mixture changed from reddish purple to dark blue 
so and the pigment becomes stabilized, and hence it is possible to detect the enzymatic activity by a si mple enzyme assay. 
On the other hand, when caffeoyl-CoA isreplaced with p-coumaroyl-CoA, acylation-derived color change and sta- 
bilization of anthocyanin are observed, but the degree of change in the tone of color is smaller than with caffeoyl-CoA. 

(4) Search for acyltransferase 

55 

Crude enzyme solutions were extracted from a variety of plants including gentians, iris, delphiniums, stocks. Eus- 
toma russellianum Griseb. pinks, sweet peas. Larkspurs, pansy, cinerarias (petals for the above plants), red cabbages, 
red onions. Kintoki carrots, western carrots, purple potatoes, perillas (edible parts for the above plants) and egg plants 
(epithelial part of the fruit), and their enzymatic activities were determined. As a result, acyltransferase activities of 0.63. 
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0.001 2, and 21 .8 nkat/mg protein were detected in the extracts from Eustoma russellianum Griseb. pinks, and gentians, 
respectively. Gentian, which had the highest acyltransferase activity per protein extracted, was used as a material for 
enzyme purification. 

Determination of protein concentration was earned out using the Bio-Rad Protein Assay (Bio-Rad). 

5 

Example 2 . Purification of acyltransferase derived from gentians 

(1) Purification of enzyme 

10 Enzyme was extracted from petals of Gentiana trif lora var. japonica. The following experiment was carried out at 0 
to 4°C unless otherwise noted. Three kilograms of petals of Gentiana trKlora var. japonica was homogenized in the 
presence of liquid nitrogen using the Excell Auto Homogenizer (DX-3; Nihoh Seiki Seisakusho). After adding 8 L of the 
extraction buffer (100 mM phosphate buffer, pH 7.0. 10 mM sodium ascorbate, 10 nM p-amidinophenyl methanesulfonyl 
fluoride hydrochloride (p-APMSF; Wako Pure Chemicals K.K.)), 5 mM dithiothreitol (DTT; Nakalaitesk). and 500 g of 

is polyclar SB-1 00 (Wako Pure Chemicals K.K.), it was completely pulverized. 

After the pulverized liquid was squeezed with 4 layers of gauze, it was further centrifuged (1 1 ,000 x g. 30 min) to 
remove cell debris. Then it was salted out with 40% saturated ammonium sulfate and insoluble matter was removed 
prior to satting out again with 70% saturated ammonium sulfate. The precipitate was suspended into 250 ml of the sol- 
ubilizing buffer (20 mMTris-HCI, pH 7.0, 10 nM p-APMSF, 1 mM DTT). and insoluble matter was removed by centrifu- 

20 gation. Then it was desalted using Sephadex G-25 column (95 x 1 1 0 mm; Pharmacia) which had been equilibrated with 
the same buffer. The protein-containing fractions were collected (860 ml) and subjected to the following chromatogra- 
phy. 

Each of chromatographies of Q-Sepharose Fast Flow, HiTrap Blue and Phenyl Superose were carried out using the 
FPLC system (Pharmacia). 

25 First, the samples were applied to Q-Sepharose Fast Flow (26 x 1 00 mm; Pharmacia) which had been equilibrated 
with the same buffer. After adequately washing the column with the same buffer, it was eluted with a liner gradient of 
sodium chloride from 0 M to 0.4 M in 60 minutes (8 ml/min). After the fractions containing enzymatic activity were 
pooled (130 ml), they were subjected to affinity chromatography. It was then applied to three columns of HiTrap Blue (5 
ml, 1 6 x 25 mm; Pharmacia) connected in a series, adequately washed with the same buffer, and eluted with the same 

ao buffer containing 1 M sodium chloride. The active fractions were safted out with 70% saturated ammonium sulfate to 
obtain a protein precipitate. 

The precipitate was suspended in 1 ml of the solubilizing buffer and insoluble matter was removed by centrifuga- 
tion, and then was applied to Sephacryl S-200 (25 x 1150 mm; Pharmacia) which had been equilibrated with the solu- 
bilizing buffer. At a flow rate of 0.2 ml per minute about 3 ml fractions were collected and after the active fractions were 

35 collected again (27 ml), sodium ammonium was added thereinto to a concentration of 1 M. After fully stirring, it was cen- 
trifuged (39,000 x g, 1 0 min) to remove insoluble matter and applied to Phenyl Superose 5/5 (5.0 x 50 mm; Pharmacia) 
which had been equilibrated with the solubilizing buffer containing 1 M sodium ammonium. 

After adequately washing at a flow rate of 0.5 ml. the concentration of sodium ammonium was linearly decreased 
from 1 M to 0 M over 60 minutes to etute protein. A 0.5 ml aliquot of each fraction was measured for enzymatic activity. 

40 In an analysis by SDS-polyacrylamide gel electrophoresis a band of molecular weight about 50.000 was observed as 
an almost single protein and since correlation was observed between this protein and activity, the protein was con- 
cluded to be the desired acyltransferase. The fractions (1 2 ml) having activity were further purified by reverse phase 
HPLC in order to obtain a single product. 

Using a DEVELOSIL 300 C4-HG-5 (4.6 x 250 mm; Nomura Kagaku K.K.) column, elution was carried out at a linear 

45 gradient of acetonitrile from 40.5% to 56.7% in the presence of 0. 1 % trif luoroacetic acid over 30 minutes at a flow rate 
of 1 ml per minute. One ml fractions were collected while monitoring absorbance at 280 nm. Each fraction was further 
analyzed by SDS-polyacrylamide gel electrophoresis to collect fractions containing protein of molecular weight about 
50,000. By repeating this HPLC for 30 times and concentrating in a speed Vac Concentrator (Savant), about 0.2 ml of 
single protein product was obtained. 

so 

(2) Analysis of purified protein 

When 500 pmol of purified product was subjected to the amino acid sequencer (PSQ-1; Shimazu Seisakusho 
K.K.), 200 pmol of glutamic acid at the first stage of Edman degradation and 90 pmol of glutamic acid at the second 
55 stage were detected, but not at the third stage and thereafter. Accordingly, it was inferred that the N-terminal of the 
enzyme was blocked in some way or other. 

However, since it is known that when the N-terminal is glutamic acid, pyroglutamic acid is formed and the sequence 
as described above by Edman degradation is observed, it is highly probable that the N-terminal of the enzyme is 
glutamic acid. 
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The remainder of the precipitate was dissolved in a solution containing 80 ul of 45 mM Tris-HCI. pH 8.5, 3.6 M urea, 
and 0.09% SDS, to which was added 16 pmol of lysyl endopeptidase (Lysyl Endopeptidase: derived from Achromobac- 
tor lyticus; Wako Pure Chemicals K.K.) and was reacted at 37°C for 6 hours. The reaction mixture was separated 
directly by a DEVELOSIL 300 C4-HG-5 column. 
5 The separation condition was a flow rate of 0.7 ml of a linear gradient of acetonitrile from 0% to 80% over 70 min- 

utes in the presence of 0.1% trifluoroacetic acid. By monitoring absorbance at 210 nm fragments having a peak absorb- 
ance were collected. Out of 13 peak fractions thus obtained, three fractions which eluted at acetonitrile concentrations 
of 32% to 40% were concentrated in the Speed Vac Concentrator and then separated and purified using an ODS col- 
umn (DEVELOSIL 300 ODS-HG-5; Nomura Kagaku KKf} under the same condition as above. ~ 
10 Each fraction was concentrated to dryness in the Speed Vac Concentrator, dissolved in 30 ul of 40% acetonitrile. 
and then subjected to the amino acid sequencer. As a result, the amino acid sequences of six peptides were able to be 
analyzed. The amino acid sequence for each peptide is shown below (the sequence is shown in the direction from the 
amino terminus to the carboxy terminus): 

Amino acid sequence (AT73): Arg-Phe-Leu-Gly-lle-Thr-Gly-Ser-Pro-Lys (SEQ ID No. 7) 
15 Amino acid sequence (AT72): lle-His-Met-Asp-Ala-Phe-Ala-Lys (SEQ ID No. 8) 

Amino acid sequence (AT741-1): Gly-Val-Glu-lle-Gly-Val-Ser-Leu-Pro-Lys (SEQ ID No. 9) 

Amino acid sequence (AT741-2): Ala-Ser-Leu-Ser-Leu-Thr-Leu-Lys (SEQ ID No. 10) 

Amino acid sequence (AT9): His-Tyr-Val-Pro-Leu-Ser-Gly-Asn-Leu-Leu-Met-Pro-lle-Lys (SEQ ID No. 1 1) 

Amino acid sequence (AT83): Val-Arg-Ala-Thr-Tyr-Val-Leu-Ser-Leu-Ala-Glu-lle-Gln-Lys (SEQ ID No. 12) 

20 

Example 3 . cDNA cloning of acvHransterase derived from gentians m 
(1) Construction of cDNA library 

25 Petals were collected from commercial gentians (Gentiana triflora var. japonica) and homogenized under liquid 
nitrogen in a mortar. From the homogenate. RNA was obtained by the method utilizing guanidine thiocyanate/cecium 
chloride and then poly A + RNA was obtained using the Oligotex (Nihon Roche) in the method recommended by the 
manufacturer. The method using guanidine thiocyanate/cecium chloride was carried out according to the method 
described in detail in R. McGookin, Robert J. Slater et al., Methods in Molecular Biology vol. 2 (Human Press Inc. 1 984). 

30 Using the obtained poly A + RNA as the template, double stranded cDNA was synthesized using the ZAP-cDNA 
synthesis kit (manufactured by Stratagene) and was cloned into phage vector XZAPII. Furthermore, using the Giga- 
packll Gold Packaging Extract kit of the same company cDNA library was constructed by the method described in the 
kit insert 

35 (2) Designing synthetic DNA primers 

Among the amino acid sequences obtained in Example 2, the sequence represented by lle-His-Met-Asp-Ala-Phe- 
Ala-Lys (SEQ ID No. 13) is very likely to be Lys-lle-His-Met-Asp-Ala-Phe-Ala-Lys (SEQ ID No. 14) considering the spe- 
cificity of lysyl endopeptidase. Using the portion represented by the amino acid sequence: Lys-lle-His-Met-Asp-Ala- 
40 Phe-Ala (SEQ ID No. 15) in this sequence, the following oligonucleotide was synthesized: 
Nucleotide sequence (Oligo 1): 5'-AARATHC AYATGGAYGC ITTYGC-3' (SEQ ID No. 16) 

Here, the sequence of nucleic acids is shown by the one-letter code in accordance with IUPAC-IBU. That is. A: ade- 
nine, C: cytosine, G: guanine, T: thymine, Y: C or T. R:A or G. H: A or C or T. and I: inosine. 

Furthermore, another oligonucleotide shown below was also synthesized based on the primer used for construc- 
ts tion of the cDNA library mentioned above: 

Nucleotide sequence (Oligo 2): 5--CTCGAGI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1-3' (SEQ ID No. 17) 

(3) Cloning of fragments of the acyttransferase gene 

so Using about 0.1 pg of double stranded cDNA derived from RNA of petals of gentians and Oligo 1 and Olig 2 as 
primers, the PCR reaction was carried out. The reaction was carried out using the polymerase chain reaction kit Gene 
Amp (Takara Shuzo K.K.) for 35 cycles with one cycle comprising 95°C for 1 minute, 45°C for 1 minute, and 72°C for 2 
minutes. When the reaction product thus obtained was run on a 1% agarose gel electrophoresis, a specific DNA frag- 
ment of about 400 bp was observed. This DNA fragment was recovered and 10 ng thereof was subjected to 25 cycles 

55 of the above-mentioned polymerase chain reaction using the DIG-nucleotide mixture (Boehringer) and synthetic nucle- 
otide I and II to obtain DIG-labelled DNA fragments. 
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(4) Cloning of cDNA of acyltransferase 

Ji phage library obtained as above was infected to E. SQii strain XL1 -Blue (Stratagene) to screen five plates (diam- 
eter, 13.5 cm) containing 50,000 plaques per plate. 
5 Phage was adsorbed to a filter (Hybond N+, Amersham) and treated in the method recommended by the manufac- 
turer, and then the filter was allowed to remain in the hybridization buffer (5 x SSC, 50% formamide, 50 mM sodium 
phosphate buffer, pH 7.0, 7% SDS, 2% Blocking reagent (Boehringer), 0.1% lauloyl sarcosine, 80 mg/ml salmon sperm 
DNA) at 42°C for 1 hour. The DIG-labelled DNA fragment obtained above was added to the hybridization solution and 
incubated for 16 hours. 

w The filter was washed with a washing solution (0.2 x SSC, 0.1% SDS) and then an enzymeimmunoassay (Boe- 
hriner Mannheim) using the DIG-specific antibody labelled with alkaline phosphatase was carried out to detect by color 
development using 5-bromo-4-chloro-3-indolyl phosphate and nitroblue tetrazolium salt. The detection method used 
was as set forth in the manufacturer's instructions. 

As a result, a few dozen positive clones were obtained. From 20 clones of them cDNA was collected on the plasmid 

is pBluescript SK-. Insertion of cDNA was examined by agarose gel electrophoresis and it was found that the insertion of 
cDNA's of varying sizes was observed in all clones and the longest among them was 1 .7 Kb. Among them 9 clones were 
chosen and were subjected to analysis by restriction enzymes. Accordingly, it was found that similar patterns of restric- 
tion enzyme cleavage were observed, though their sizes were varied. 

20 (5) Determination of nucleotide sequence 

Plasmid was extracted from the clones thus obtained. Using the AB1 373A DNA Sequencer (Perkin Elmer), for six 
clones (pGAT2, pGAT3. pGAT4, pGAT7, pGAT8, and pGAT11) out of nine which are considered to contain the full- 
length, the nucleotide sequence of the 5' end of cDNA was determined by the dideoxy sequence method using the f luor- 
25 ogenic reagents recommended by the same manufacturer. 

The result suggested that these clones have the same nucleotide sequence and differ in the length of cDNA. From 
among these clones, the entire nucleotide sequence of pGAT4 was determined. Determination of the nucleotide 
sequence was carried out for each clone after a series of deleted clones were obtained using the Deletion Kit for Kilo- 
Sequence (Takara Shuzo. K.K.). 

30 

(6) Comparison of the nucleotide sequence with the amino acid sequence 

cDNA which was inserted into pGAT4 represented 1 703 bases, which was found to contain an open reading frame 
(ORF) comprising 1410 bases (containing the stop codon). The sequence is shown in the sequence listing SEQ ID No. 
35 1 . Since all of the partial amino acid sequences of the acyltransferase revealed in Example 2 occurred as amino acid 
sequences in the ORF, it was concluded that the cloned cDNA was the gene of acyltransferase derived from gentians. 
Analysis of the amino terminal of the initiation codon suggested that glutamic acid is the residue of the amino terminal, 
so that it was inferred that the first ATG from the 5" end was the initiation codon on the nucleotide sequence of the cDNA. 
On the other hand, since the cDNA of pGAT8 is shorter than pGAT4 by 7 bases at the 5' end, it was suggested that 
40 this was not the full-length cDNA. 

Example 4 . Expression of oenes in E. coli 

(1) Construction of expression plasmid 

45 

pTrc99A (Pharmacia), an E. coli expression vector, was used for expression of the acyltransferase gene of E. col- 
This pTrc99A contains &. col trc promoter which can be induced by isopropyl-p-D-thiogalactopyranoside (IPTG) and 
therfore by inserting the desired gene downstream of said promoter the gene can be expressed in EL col 

A restriction enzyme Ncol site has been inserted thereinto by making use of the initiation codon, ATG sequence, 
so so that direct expression of the desired gene from the initiation codon is possible by recombining it with Ncol. 

pGAT10 was constructed by recombining the 1.8 kb DNA fragment (containing all nucleotide sequences as set 
forth in SEQ ID No. 1) obtained by digestion of pGAT4 with EcoRI and Kpnl which are present in the present vector with 
the EcoRI and Kpnl sites of the above-mentioned pTrc99A. 

In order to introduce a Ncol site in the vicinity of the initiation codon of the acyltransferase, the following two oligo- 
55 nucleotides were synthesized which correspond to the vicinity of the initiation codon and the inside of the acyltrans- 
ferase (about 300 bases from the initiation codon): 

Oligonucleotide (GAT- Ncol): 5'-TTCACCATGGAGCAAATCCAAATGGT-3' (SEQ ID No. 18) 

Oligonucleotide (GAT-Scal): 5'-CGAGTCGCCCTCATCAC-3• (SEQ ID No. 19) 

With 1 0 ng of pGAT4 as the template, a PCR reaction was carried out using the above oligonucleotides as the prim- 
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ers. The reaction was carried out using the polymerase chain reaction kit Gene Amp (Takara Shuzo K.K.) for 15 cycles 
with one cycle comprising 95°C for 1 minute, 56°C for 1 minute, and 72°C for 2 minutes. When the reaction product thus 
obtained was run on a 1% agarose gel electrophoresis, a specific DNA fragment of about 300 bp was observed. This 
DNA fragment was collected and cleaved with restriction enzymes Ncol and Aatl. It was then ligated to a 6 kg fragment 
s which was obtained by cleaving pGAT101 with Ncol and Aatl to construct pGAT102. It was confirmed that the nucleotide 
sequence of the PCR-amplified portion was the same as that of pGAT4 after construction of pGAT102. 

(2) Expression of acyltransferase gene in E. coli 

10 pGAT102 was used to transform E. coli MM294 (supE44 hsdR endA1 pro thi) (Meselson and Yuan, Nature, 217: 
1110-, 1968). The host used here need not be specifically defined and may be any £. coli host which can be used as 
the host for transformation, and other strains (such as JM109, DH5. etc.) which are generally used for transformation 
and which are readily available to those skilled in the art can be employed. The method for transforming EL cpji was as 
described by Hanahan (J. Moi. Biol., 166: 557-, 1983). The transformed EL coli was inoculated into 2 ml of LB medium 

15 (trypton 10 g, yeast extract 5 g, sodium chloride 10 g were dissolved in one liter of distilled water and pH was adjusted 
to 7.2 with sodium hydroxide) and incubated at 37°C overnight. 

One ml of the culture liquid was inoculated into 10 mi of M9 medium (sodium hydrogen phosphate 0.6%, potassium 
dihydrogen phosphate 0.3%, sodium chloride 0.5%, ammonium chloride 0.1%, glucose 0.5%, magnesium sulfate 1 
mM, vitamin B1 4ng/ml. pH 7.2) to which were added 0.5% casamino acid and 50 ng/ml of ampicillin, and cultured at 

20 37°C for 3 hours, and then 40 jil of 0.5 M IPTG (the final concentration, 2 mM) was added and culture was continued 
for 5 more hours. After harvesting the cells, they were washed with 30 mM Tris-HCI buffer, pH 7.5, containing 30 mM 
sodium chloride, and then the washed cells were suspended into 1 ml of the same buffer. To the cells were added 1 mg 
of lysozyme, 25 y\ of 0.25 M EDTA, and allowed to stand at 0°C for 30 minutes. The cells were then frozen and thawed 
for three times to disrupt the cells. 

25 After centrifugation at 15,000 rpm for 30 minutes, the supernatant obtained was used as a crude enzyme solution 
and the enzymatic activity thereof was determined in the method for determination of enzymatic activity as set forth in 
Example 1 (3). In the microtiter plate method the acyl group transfer reaction was confirmed in E. coji to which pGAT1 02 
was introduced. Accordingly, they were analyzed by HPLC. As a result it was found that in the EL CQli into which 
pGAT102 was introduced, 18.3 nmol of delphinidin 3-glucosyl 5-caffeoyl glucoside was formed from 24 nmol of delphi- 

30 nidin 3,5-diglucoside and 21 .5 nmol of caffeoyl-CoA. 

Combining this result with the known fact that in anthocyanin of gentian the acyl group is bound to glucose at posi- 
tion 5 and position 3', it was revealed that the acyltransferase encoded by pGAT4 catalyzes the reaction of transferring 
an acyl group to glucose at position 5 of anthocyanin 3,5-diglucoside. 

Furthermore, delphinidin 3,5-diglucoside which was acylated by acyltransferase produced in EL coli have also 

35 shown a stable color development when allowed to stand at room temperature for a prolonged period of time similarly 
to the one acylated by acyltransferase obtained by purifying from gentian. 



40 (1) Expression vector in yeast 

As the expression vector of yeast, pYE22m as described in Japanese Unexamined Patent Publication (Kbkai) No. 
4-228078 was used. 

45 (2) Expression of the acyltransferase gene in yeast 

About 1 .8 kb of DNA fragment obtained by digesting either pGAT4 or pGAT8 at restriction enzyme sites, EcoRI and 
Kpnl. present in each of said vectors was ligated to about 8 kb of DNA fragment obtained by digesting similarly pYE22m 
at EcoRI and Kpnl sites to construct yeast expression vectors pYGAT4 and pYGAT8. pYGAT4 starts translation at the 
50 first methionine, but pYGAT8 which lacks part of 5' end of the isolated cDNA starts translation not at the translation ini- 
tiation methionine of acyltransferase (number of amino acid sequence in the sequence listing SEQ ID No: 1), but at the 
next methionine (number of amino acid sequence in the sequence listing SEQ ID No: 5). 

In these yeast expression vectors, the cDNA encoding acyltransferase is ligated downstream of the promoter for 
glyceraldehyde-3 phosphate dehydrogenase, one of the constitutive yeast promoters, and its transcription has been 
55 regulated by said promoter. 

Using the method by Ho et al. (Ko et al., J. Bacterid., 153: 163-168, 1983), a yeast Saccharomyces cereviciae 
G1315 (Ashikari et al., Appl. Microbiol. Biotechnol. 30. 515-520. 1989) was transformed. The transformed yeast was 
selected based on its recovery of synthetic ability of tryptophan. 

It should be noted that the yeast host as used herein for transformation is not limited, but it may be any strain which 
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displays a tryptophan requirement due to its incomplete TRP1 gene (for example, one commercially available from the 
Yeast Genetic Stock Center; Berkely, CA, USA; Catalogure 7th edition (1991), page 36). 

The transformant obtained was cultured under shaking in 10 ml of Burkholder medium (Burkholder, Amer. J. Bot. 
30: 206-210) containing 1% casamino acid. As a control experiment, the yeast which has spontaneously recovered its 

5 synthetic ability of tryptophan was cultured in a similar manner. 

After havesting the cells, they were washed with the same amount of the cell disruption buffer (30 mM Tris-HCI. pH 
7.5, 30 mM sodium chloride), and suspended further in 1 ml of the same buffer and then was transferred into a 1 .5 ml 
Eppendorf tube. After centrifugation, the supernatant was removed and the precipitated cells were resuspended into 
0.4 ml of the same buffer, to which was added 400 mg of glass beads (Glass Beads 425-600 microns Acid-Wash. 

w Sigma) and shaken vigorously to disrupt the yeast cells. 

The supernatant after centrifugation was used as a crude enzyme solution and the enzymatic activity thereof was 
determined by the method for determination of enzymatic activity as set forth in Example 1 (3). Since acyl group transfer 
reaction was observed by the microtiter plate method in all yeasts into which pYGAT4 and pYGAT8 were introduced, 
they were then analyzed by HPLC. The yeast used as the control did not show any activity of acyl group transfer. 

is The result indicated that 16.6 nmol and 20.9 nmol of delphinidin 3-glucosyl 5-caffeoyl glucoside were formed from 
24 nmol of delphinidin 3,5-diglucoside and 21 .5 nmol of caffeoyl-CoA, respectively in the yeast into which pYGAT4 and 
pYGAT8 were introduced. Both of the proteins which were produced by pYGAR and pYGAT8 had acyl group transfer 
activity though their amino termini were different. 

Furthermore, delphinidin 3,5-diglucoside which was acylated by the acyltransferase produced by the yeast have 

20 shown a stable coloration even when allowed to stand at room temperature for a prolonged period of time similarly to 
the one acylated by the acyltransferase obtained by purifying from gentians. 

Examp le 6. cPNA cloning of the acyltransferase derived from gentian (2) 

25 Among the DNA fragments obtained by digestion of pGAT4 as set forth in Example 3(6), i.e. pGAT4 having the DNA 
as set forth in the sequence listing SEQ ID No. 1 , with restriction enzymes EcoRI and Ndel, two DNA fragments which 
contain the translation region of acyltransferase were collected together and labelled with DIG as in the methods men- 
tioned above. Using this as a probe, the phage of cDNA library from petals of gentians was adsorbed onto a filter 
(Hybond N+, Amersham), which was then regenerated by removing the pigments and the DIG labels attached to the 

30 filter according to the method recommended by the manufacturer (Amersham) and subjected to hybridization in a low 
concentration formamide hybridization buffer (5 x SSC, 30% formamide. 50 mM Tris-HCI. pH 7.5. 1% SDS) at 42°C for 
16 hours. 

After washing at 50°C in the wash solution (5 x SSC, 0.1% SDS), the fitter was allowed to develop color as 
described in Example 3(4). A few dozen clones developed color. Out of the clones which developed color, 12 clones 
35 which did not develop color in Example 3(4) were obtained. The nucleotide sequences of cDNA of these clones were 
determined from the 5" end in the above-mentioned method to find that the nucleotide sequences of 11 clones coin- 
cided with that of pGAT4, but one clone did not, which was designated as pGATl06. 

The entire nucleotide sequence of pGAT106 was determined as described above. The cDNA introduced into 
pGAT106 represented 1622 bases in length, in which an ORF comprising 1440 bases (containing the stop codon) was 
40 found. This is shown in the sequence listing SEQ ID No. 2. For the ORF contained in SEQ ID No. 2, its homology was 
examined with the entire region of the amino acid sequence encoded by pGAT4. The homology was found to be 38%. 

Since the amino acid sequence encoded by pGAT106 is homologous with that of the enzyme encoded by pGAT4, 
it is inferred that the former has a similar enzymatic activity, i.e. an activity of catalyzing acyl group transfer to anthocy- 
anins. The fact that the anthocyanin of gentians has acyl groups at glucose molecules at position 5 and 3' suggests that 
45 pGAT106 catalyzes the enzymatic reaction of transferring an acyl group to position 3' of anthocyanin. The result indi- 
cates that acyltransferases may be different in the positions of sugars of anthocyanins which they transfer an acyl group 
but that the amino acid sequences and the nucleotide sequences encoding them are homologous. 

As hereinabove described, there are many anthocyanins which have acyl groups, and these compounds vary 
widely in the number and position of acyl groups. Accordingly, it is expected that there are a number of enzymes which 
so catalyze acyl group transfer reaction. It is readily inferred that the amino acid sequences of these enzymes have homol- 
ogy with the amino acid sequences of pGAT4 and pGAT1 06 obtained herein. Based on this other acyltransferase genes 
can be obtained. 

55 

Anthocyanins found in a mutant (VM) in which the color of the flower was changed to purple from the original red- 
dish purple of Petunia hvqrida ver. Surf ina purple (Suntory Ltd.). a breed of, was extracted by pulverizing petals thereof 
in liquid nitrogen and then extracting with 50% acetonitrile and 0. 1% TFA. After filtration, the filtrate was separated and 
purified by ODS and ODP reverse phase column chromatographies. When one of the compounds was analyzed in 
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detail by FABMS. 1 H NMR, and 13 C NMR, a new anthocyanin was found. Its structure is shown below: 




That is, the structure was 

3-0-(6-0-(4-0-(4-0-{6-0-caffeoy1-B-D-glucopyranosyl)-coum aroyl)-a-L-ramnosyl)-p-D-glucopyranosyl)-5-0-p-D-glu- 
25 copyr anosyl-malvidin, or an anthocyanin to which two acyl groups are bound. 
In addition, 

3-0-(6-0-(4-0-(4-0-(6-0-coumaroyl-p-D-glucopyranosyl)-cou maroyl)-a-L-ramnosyl)-p-D-glucopyranosyl)-5-0-p-D- 
glucopy ranosyl-malvidin, 

3-0-(6-0-(4-0-{4-0-(6-0-caffeoyl-p-D-glucopyranosyl)-caff eoyl)-a-L-rarmosyl)-p-D-glucopyranosyt)-5-0-p-D-glucop- 
30 yra nosyl-malvidin, 

3-0-(6-0-(4-0-(4-0-(6-0-coumaroyl-p-D-glucopyranosyl)-caf feoyl)-a-L-ramnosyl)-p-D-glucopyranosyl)-5-0-p-D-glu- 
copyr anosyl-malvidin were also detected. The anthocyanins were found to be present in the dark purple petals of Ful- 
con Blue (Sakata Seed Corp.). Old Glory Blue (Bait Seeds), and the like. Thus, anthocyanins having two acyl groups 
conceivably contribute to the dark purple color of petunias. 
35 Accordingly, the foregoing suggests that acyltransferases related to anthocyanins derived from petunia come in two 
types: the enzyme which catalyzes a reaction of transferring coumaric acid or caffeic acid to rutinoside at position 3 of 
anthocyanin, and the enzyme which catalyzes a reaction of transferring coumaric acid or caffeic acid to monoacyl mal- 
vidin via glucose. 

40 Example 8 . cDNA cloning of the acvttransferase derived from petunias 

cDNA portion of pGAT4 described in Example 3(6), i.e. pGAT4 having DNA as set forth in the sequence listing SEQ 
ID No. 1 , was labelled with DIG in the method described above, and the cDNA library of petals of Petunia hvbrida var. 
Old Glory Blue was screened by the plaque hybridization technique. Hybridization and washing were carried out under 
45 the condition similar to the one as set forth in Example 6. 

About 200,000 clones were screened, from which one weakly hybridizing clone was obtained. This clone was des- 
ignated as pPAT5. Determination of the nucleotide sequence revealed that more than one DNA were inserted in pPAT5. 
Thus, there was a sequence similar to that of C-terminal of the protein encoded by pGAT4 and pGAT106 in the reverse 
primer side of the plasmid. Based on the reverse primer, a nucleotide sequence: 5'-AACAGCTATGACCATG-3' (SEQ ID 
so No. 20) was synthesized and the oligonucleotide was designated as the RP primer. 

In order to obtain the full-length cDNA of pPAT5. 100 ng each of the RP primer and the oligo 2 primer, and 10 ng of 
pPAT5 digested with Xhol were subjected to PCR reaction in a final volume of 50 jd. The reaction cas carried out for 20 
cycles with one cycle comprising 95°C for 1 minute. 55°C for 1 minute, and 72°C for 1 minute. The DNA fragment of 
about 600 bp thus obtained was run on an agarose gel electrophoresis and purified by GENECLEAN. After the frag- 
55 ment was enzyme digested with Smal, the DNA fragment of about 400 bp was purified in a similar manner. The DNA 
fragment was labelled with the above-mentioned DIG. 

Using the labelled DNA fragment the above cDNA library of petals of petunias was screened by the plaque hybrid- 
ization technique. Washing after hybridization was carried out in 0.2 x SSC at 65°C for 1 hour. Determination of the 
nucleotide sequence of the plasmid recovered from the done obtained revealed that pPAT48 contained the same 
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sequence as pPAT5 does. This is shown in the sequence listing SEQ ID No. 3. This sequence had a homology of 20% 
and 16% with pGAT4 and pGAT106, respectively, at the level of amino acid sequence. 

Example 9 . Extraction of crude enzyme solutio n from perillas 

5 

Young red leaves were harvested from the plant bodies of Perilla ocimoides var. Akachirimen, and a crude enzyme 
solution was extracted according to the method as set forth in Example 1(2). This was reacted with 50 ul of a mixture 
containing, at a final concentration, 50 mM potassium phosphate, pH 8.5, 0.48 mM delphinidin 3,5-diglucoside. 0.43 
mM caffeoyl-Co A and 20 ul of the enzyme solution at 30°C for 1 0 minutes. 50 \i\ of acetonitrile containing 1 3.8% acetic 

10 acid was added to the reaction mixture to stop the reaction. After centrrf uging at 1 5000 rpm for 5 minutes, a 1 0 ^ aliquot 
of the supernatant was analyzed by HPLC under the following conditions. 

The column used was the YMC-Pack ODS-A (6.0 x 15 cm), and samples were separated under the condition of 
0.1% trifluoroacetjc acid, 21.6% acetonitrile, and a flow rate of 1 ml/min. Detection was carried out at 520 nm. Under 
this condition unreacted delphinidin 3,5-diglucoside was eluted at 3 minutes and the one in which caffeic acid was trans- 

is ferred to position 3 of delphinidin 3,5-diglucoside was eluted at 4.7 minutes, the absorption maximum of said compound 
being 531 nm. 

Modification by caffeic acid was also seen when delphinidin 3-glucoside was used as the substrate. Furthermore, 
when coumaroyl-CoA was used as a donor of acyl group, transfer of a coumaroyl group was observed. It was revealed 
that although natural perillas do not contain delphinidin glucoside as anthocyanin, the acyltransferase of perillas can 
so use delphinidin 3-glucoside and delphinidin 3,5-diglucoside as the acyl group recipient and coumaroyl-CoA as the acyl 
group donor. 

Example 10 . Purification of the acyltransferase derived from perillas 

ss Purification of the acyltransferase derived from perillas was carried out in accordance with the method described 
in Example 2(1). Three kilograms of leaves of perillas was frozen in liquid nitrogen and pulverized frozen in a homoge- 
nizer. The pulverized material was homogenized again in 10 liters of the extraction buffer (100 mM sodium phosphate, 
pH 6.8, 10 mM sodium ascorbate, 5 mM dithiothreitol, 10 uM p-APMSF, 5% (w/v) polyclar SB-100) in a homogenizer. 
This was filtrated with gauze stacked in four layers, and then centrifuged (8,000 rpm, 4°C. 30 minutes). Ammonium sul- 

30 fate was added to the supernatant to a 40% saturation. After dissolution, centrtfugation is repeated under the same con- 
dition. Ammonium sulfate was added to the supernatant to a 70% saturation. After dissolution, centrifugation is 
repeated under the same condition. The precipitate was dissolved in a minimum amount of the desalting buffer (bis Tris- 
HCI, pH 6.3, 1 mM dithiothreitol, 10 uM p-APMSF, 10% glycerol), and then desalted by Sephadex G-25 medium (Phar- 
macia. 9.5 x 45 cm) which had been equilibrated with the same buffer. 

35 The desalted sample was subjected to ion exchange chromatography using Q-Sepharose Fast Flow 26/1 0. A linear 
gradient of sodium chloride from 0 to 0.5 M in the desalting buffer was run at a flow rate of 8 ml/min over 1 hour. The 
active fractions were eluted at NaCI concentrations of about 0.15 to 0.3 M. The active fractions were adsorbed to four 
HiTrap Blue (5 ml) columns connected in a series which had been equilibrated with the desalting solution. After ade- 
quately washing the columns with the same buffer, elution was carried out by a linear gradient of sodium chloride from 

40 0 to 1 M in the desalting buffer (2 hours, flow rate 5 ml/min). The active fractions were eluted at NaCI concentrations of 
0.8 to 0.9 M. These fractions were subjected to chromatography using a hydroxyapatite column (ceramic type II 40 mm; 
Bio-Rad). The column on which a sample had been applied was adequately washed with buffer A (50 mM sodium phos- 
phate, pH 6.8, 1 mM dithiothreitol, 10 jiM p-APMSF, 1 0% glycerol). Then enzyme was eluted with a liner gradient from 
buffer A to buffer B (400 mM sodium phosphate, pH 6.8. 1 mM dithiothreitol. 10 uM p-APMSF, 10% glycerol), which 

45 eluted at about 0.2 M sodium phosphate. This active fractions were used for biochemical characterization of the 
enzyme. 

In a similar manner to when the crude enzyme sample was used, any of cyanidin 3-glucoside, cyanidin 3,5-diglu- 
coside, delphinidin 3-glucoside, and delphinidin 3,5-diglucoside could be used as the acyl group recipient. As the acyl 
group donor coumaroyl-CoA and caffeoyl-CoA could be used. The molecular weight was found to be about 50,000 by 
so SDS-polyacrylamide gel electrophoresis. The isoelectric point was determined to be 5.3 using a Mono-P column (Phar- 
macia). 

Example 1 1 . cDNA cloning of the acyltransferase derived from perillas 

55 By comparing the structures of pGAT4, pGAT106, and pGAT48 which were cloned in Example 3, Example 6, and 
Example 8. respectively, it was found that the amino acid sequence: Asp-Phe-Gly-Trp-Gly-Lys (SEQ ID No. 21) have 
been conserved. Accordingly, it is expected that this structure is also conserved in acyltransferases. Based on this 
sequence, nucleotide sequence: 5'-GA(TC)TT(TC)GGITGGGGIAA-3' (SEQ ID No. 22) was synthesized and this oligo- 
nucleotide was used as an ATC primer. 
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From young leaves of perillas, RNA was extracted in the method as described in Example 3 and a cDNA library was 
also constructed using the ZAP-cDNA synthesis kit (Stratagene). Using 50 ng of double stranded cDNA which was 
formed here as the template and 100 ng each of the ATC primer and the oligo 2 primer, PCR reaction was carried out 
in a final volume of 50 ul using the PCR kit (Takara Shuzo). The reaction was carried out for 25 cycles with one cycle 

5 comprising 95°C for 1 minute, 50°C for 1 minute, and 72°C for 1 minute. A DNA fragment of about 400 bp thus obtained 
was recovered and cloned into a vector using the TA cloning kit (Invitrogen). Determination of the nucleotide sequence 
of the clone obtained revealed that the clone designated as pSAT104 had a high homology with pGAT4. 

Using 10 ng of pSAT104 as the template and 100 ng each of the ATC primer and the oligo 2 primer, PCR reaction 
was carried out using the PCR kit (Takara Shuzo K.K.) in a final volume of 50 jxl. The reaction was carried out for 15 

io cycles with one cycle comprising 95°C for 1 minute, 50°C for 1 minute, and 72°C for 1 minute. Using 1 ul of this product 
and 100 ng each of the ATC primer and the oligo 2 primer, PCR reaction was carried out using the PCR kit in a final 
volume of 50 ul. 

However, 4 ul of the DIG-labelled nucleotide solution (manufactured by Boehringer) was used here as the deoxy- 
nucleotide solution. After the reaction was complete, 5 ul of 3 M sodium acetate and 100 jil of ethanol were added to 

15 carry out ethanol precipitation. The product obtained was used for the subsequent study. 

Using the labelled DNA fragment derived from pSAT104. the cDNA library of leaves of perillas was screened by the 
plaque hybridization technique. Washing was carried out in 1 x SSC at 65°C for 1 hour. Determination of the nucleotide 
sequence of the hybridized clone revealed that-the clones of pSAT206, pSAT207, pSAT208, pSAT209, pSAT210, etc. 
contains the nucleotide sequence of pS AT1 04. When the nucleotide sequence of the 5' ends of these clones were com- 

20 pared with that of pGAT4, every clone had an amino terminal shorter than pGAT4 and none had the initiation codon. 
The nucleotide sequences of 5* ends of pSAT206 and pSAT208. and pSAT209 and pSAT210 were identical. pSAT207 
was shorter than pSAT206 by 6 residues, and pSAT209 was shorter than pSAT206 by 5 residues. 

On the vector pBluescript SK-. these cDNA's are taking such forms that enable them to fuse to the LacZ gene of 
the vector. Out of the above-mentioned clones, pSAT206, pSAT208. and pSAT207 are taking such shapes that enable 

25 them to express as a fusion protein with p-galactosidase, whereas pSAT209 and pSAT210 have shifted frames so that 
they cannot form a fusion protein. pSAT206. pSAT207, pSAT209, and pSAT210 were expressed in E coH. and then 
tested for the enzymatic activity of acyl group transfer to position 3 of glucose using delphinidin 3,5-diglucoside and 
caffeoyl-CoA. The method for inducing expression etc. was carried out in accordance with the method as set forth in 
Example 4. 

30 The E coli 's containing pSAT209 and pSAT210 did not exhibit any enzymatic activity of transferring acyl groups, 
but the E, ca!i containing pSAT206 exhibited the enzymatic activity of acylating 48% of delphinidin 3,5-diglucoside and 
the E. coli containing pSAT207 exhibited a similar enzymatic activity of acylating 24% of said compound. These results 
demonstrated that pSAT206, pSAT207, and the like reveal cloning of the gene having the enzymatic activity of transfer- 
ring acyl groups to glucose at position 3 of anthocyanin of perillas. 

35 Among these clones, the nucleotide sequence derived from cDNA of pSAT208 was determined. This is shown in 
the sequence listing SEQ ID No. 4. The amino acid sequence deduced from the nucleotide sequence exhibited a 
homology of 37%, 29%. and 15% with pGAT4. pGAT106, and pPAT48, respectively. As described hereinbefore, this 
sequence, though not a full-length cDNA, can express active enzymes by providing a suitable initiation codon as a 
fusion gene with LacZ. 

40 By comparing the amino acid sequences of acyltransf erases which were elucidated by the present invention, the 
conserved sequence was clarified. Based on the amino acid sequence of this region, it is possible to clone acyltrans- 
ferases which modify sugars at other positions of anthocyanins. 

Example 12 . cDNA cloning of the acvltransferase d erived from cinerarias 

45 

From petals of Senecio cruentus var. Jupiter Blue (Sakata Seed Corp.), RNA was extracted by the method as set 
forth in Example 3 above and Poly A + RNA was further purified. A cDNA library was constructed using the ZAP-cDNA 
synthesis kit (Stratagene). 

Using 50 ng of double stranded cDNA which was formed here as the template and 100 ng each of the ATC primer 
so and the oligo 2 primer, PCR reaction was carried out in a final volume of 50 ul using the PCR kit (Takara Shuzo). The 
reaction was carried out for 25 cycles with one cycle comprising 95°C for 1 minute, 50°C for 1 minute, and 72°C for 1 
minute. DNA fragments of about 400 bp thus obtained were collected and cloned into a vector using the TA cloning kit 
(Invitrogen). Determination of the nucleotide sequence of clones obtained revealed that a clone designated as pJAT4 
had a high homology with pGAT4. 
55 Then the cDNA library of petals of cinerarias was screened with pJAT4. Several clones were obtained. When the 
amino acid sequences deduced from the nucleotide sequences of the 5' end of these cDNA's were compared with the 
sequence of the protein encoding pGAT4, none of cDNA of the clones of cinerarias were full-length. Among them the 
entire nucleotide sequence of cDNA of the done termed pCAT8 was determined. This is shown in the sequence listing 
SEQ ID No. 5. The amino acid sequence deduced from the nucleotide sequence obtained exhibited a homology of 
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28%. 35%, 16%. and 37% with pGAT4, pGAT106. pPAT48. and pSAT208. respectively. 

Example 13 . Construction of a binary vector containing the gene of the acvltransferase derived from gentian 

5 After the acvltransferase gene of gentians, pGAT4. was completely digested with Kpnl, a DNA fragment of about 

1 .6 kb which is obtained by partial digestion thereof with Xbal was collected. This DNA fragment was subcloned using 
the restriction enzyme recognition sites, Kpnl and Xbal, of pUCl9 to obtain a plasmid pUCGAT4. After pUCGAR was 
completely digested with Bglll, it was partially digested with Sad to collect a DNA fragment of about 0.95 kb. A plasmid 
obtained by ligating this DNA fragment, about 0.75 kb DNA fragment obtained by digestion of pUCGAT4 with Xbal and 

10 Bglll. and a DNA fragment obtained by digestion of plasmid p2l 13G (described, for example, in Aida et al., Acta Horti- 
culture, 392: 219-225. 1995) with Xbal and Sacl was designated as pBEGA4. This plasmid is a binary vector, and the 
gentian acyl transferase cDNA is under the control of the cauliflower mosaic virus 35S promoter having enhancers and 
the nopaline synthase terminator within plant cells. It also has a translation enhancer called fl sequence at the 5' end 
of cDNA of the gentian acvltransferase. It is noted that the promoter and terminator as used herein are not limited to 

is those just described, but they may be a constitutive promoter or a promoter which specifically works in petals such as 
the promoter of the gene of chalcone synthase. 

Example 14 . Introduction of the acvltransferase gene derived from gentian into plants 

20 pBEGA4 was introduced into Aorobacterium tumefacience strain Agio (Lazo et al., Bio/Technology, 9: 963-967, 
1991) by the method described in Plant Molecular Biology Manual (Kluwer Academic Publishers). On the other hand, 
by culturing a shoot apex of a rose var. Lavende in a solid medium in which BA (6-benzyl aminopurine) 2.25 mg/l, GA3 
(gibber ellic acid) 3.46 mg/l, sucrose 30 g/l. and Gelian Gum 2 g/l were added to the MS medium to obtain Embriogenic 
Callus (EC). An overnight culture of the above AG10 strain in the LB medium was suspended to the MS liquid medium 

25 containing 20 ng/ml of acetosyringone to adjust to a concentration of 5 x 10° cells/ml. After immersing the EC in this 
bacterial culture liquid, excess liquid was wiped clean by sterilized filter paper. By transplanting and culturing in the MS 
medium in which BA 2.25 mg/l, GA3 0.35 mg/l, sucrose 30 g/l, and Gellan Gum 2 g/l were added to the MS medium, a 
transformarrt can be obtained. From the kanamycin resistant callus obtained, RNA was obtained using trizol (LHetec 
Oriental). Using this RNA as the template, and nucleotide GAT-1 : 5'-TGGCAACTGTCTTGCGTCATG-3' (SEQ ID No. 

30 23) and nucleotide GAT-2: 5'-CCATGTCAGGTGTGAGGTTCAAC-3' (SEQ ID No. 24) synthesized based on the nucle- 
otide sequence of pGAT4 as the primer, RT-PCR reaction was carried out using the Access RT-PCR System 
(Promega). Using the same RNA as the template, and oligonucleotide Kan-1 : 5'-ATCGTTTCGCATGATTGAAC-3' (SEQ 
ID No. 25) and oligonucleotide Kan-2: 5'-TCAGAAGAACTCGTCAAGAA-3' (SEQ ID No. 26) synthesized based on the 
nucleotide sequence of nptll on the binary vector as the primer, RT-PCR reaction was similarly carried out. The reaction 

35 was carried out for 40 cycles with one cycle comprising 94°C for 30 seconds. 60°C for 1 minute, and 68°C for 2 minutes. 
From the callus of the transformant a band derived from pGAT4 and a band derived from nptll were observed, but from 
the callus of the non-transformant no bands corresponding to the above were observed. The result indicates that the 
gene of acyltransferase of gentians could be introduced into the rose. 

Construction of the binary vector mentioned above and its transformation into a plant are not limited to the gene of 

40 acyltransferase of gentians contained in pGAT4, but other acyltransferases can be introduced into plants and genes 
thereof can be expressed in plants. As a species of a plant, a rose was described hereinabove. But since methods of 
transformation have been reported for many other plants (for example, carnations, chrysanthemums, tobaccos, petu- 
nias, gerberas. petunias, etc.). acyltransferase could be introduced into many plant species by employing published 
methods. 

45 

Example 15 . Synthesis of the full-length cDNA of the acvltr ansferase derived from perillas 

The cDNA of acyltransferase gene of perillas, pSAT208. encodes active enzymes as described above, but it was 
not full-length. Accordingly, a full-length cDNA containing the initiation codon was synthesized based on the nucleotide 
so sequence of acyltransferase gene of gentians. pGAT4. Thus, the DNA shown below was synthesized. The amino acid 
sequence encoded by the DNA is also shown. The first underline means a BamHI recognition sequence, and the next 
underline means a sequence contained in pSAT208. Behind the BamHI recognition sequence is inserted a sequence 
AACA which often occurs immediately before the translation initiation codon in plants. 

55 
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5 ' GGGATCCAACA ATG GAG CAA ATC CAA ATG GTG G CC GTG ATC GAA ACG TGT AGA 3 * 
Met Glu Gin He Gin Met Val Ala Val He Glu Thr Cys Arg 
5 (SEQ ID No. 27) 

PCR reaction was carried out in a final volume of 50 fil containing 100 ng each of this primer and -20 primer: 5'- 
GTAAAACGACGGCCAT-3' (SEQ ID No. 28), and 1 0 ng of pSAT208. The reaction was carried out for 1 5 cycles with one 

10 cycle comprising 95°C for 1 minute, 55°C for 1 minute, and 72°C for 2 minutes. After the reaction, DNA fragments were 
recovered from the reaction mixture using GENECLEAN (Biol01) by the method recommended by the manufacturer. 
After digesting the recovered DNA with BamHI and EcoRI, followed by agarose gel electrophoresis, a DNA fragment of 
about 200 bp was recovered. This DNA fragment was ligated to a DNA fragment of about 3.3kb obtained by digesting 
pSAT208 with EcoRI and the plasmid obtained was termed pSATF208. The nucleotide sequence of this plasmid was 

15 determined from 5' end of the cDNA to confirm the nucleotide sequence. 

Example 16 . Expression of the gene of the acvltransferase derived from oerillas in veast 

In accordance with the method described in Example 5, pSATF208 was expressed in yeast and tested for enzy- 
20 matic activity. Thus, a plasmid obtained by ligating a DNA fragment of about 8 kb obtained by digestion of pYE22m with 
BamHI and Sail with a DNA fragment of about 8 kb obtained by digestion of pSATF208 with BamHI and Sail was termed 
pYSAT208. 

Yeast G1 315 was transformed with pYSAT208 and the activity of acvltransferase of the resulting transformant was 
determined. As a result, in the yeast into which pYSAT208 was introduced, formation of 10 nmole of delphinidin 3-catfe- 
25 oylglucoside 5-glucoside from 24 nmol of delphinidin 3,5-diglucoside and 21.5 nmol of caffeoyl-CoA was observed. 
Thus, it was confirmed that the synthesized full-length cDNA cotained in pSATF208 encodes the activity of acvltrans- 
ferase. 

Example 17 . Construction of a binary vector containing the acvltransferase oene derived from perillas 

30 

Plasmid pE12fX3US is one in which the expression unit of GUS gene on plasmid p21 13G (Aida et al., Acta Horti- 
culture, 392: 219-225. 1995) has been inserted into the Hindlll and EcoRI recognition sites of pUC19. After pE120GUS 
was digested with Sad and blunt-ended using the DNA blunting kit (Takara Shuzo), it was ligated to an Xhol linker (Toy- 
obo). The plasmid obtained which has an Xhol linker inserted thereinto was termed pE12fX3USx. A plasmid obtained 
35 by ligating about 2.8 kb DNA fragment obtained by digestion of this plasmid with Hindlll and EcoRI to pBin19 digested 
with Hindlll and EcoRI was designated pBEGUSx. A plasmid obtained by ligating a DNA fragment of about 11 kb 
obtained by digestion of BEGUSx with BamHI and Xhol to a DNA fragment obtained by digestion of pSATF208 with 
BamHI and Xhol was termed pBESA208. On this plasmid the acyttransferase of perillas is under the control of the cau- 
liflower mosaic virus 35S promoter having enhancers and the nopaline synthase terminator. 

40 

Example 18. Introduction of the acyttransferase gene pf perillas into plants 

pBESA208 was introduced into Agrobacterium tumefacience strain Ag10 (Lazo et al., Biotechnology, 9: 963-967, 
1991) by the method described in Plant Molecular Biology Manual (Kluwer Academic Publishers). Agio strain trans- 
45 formant was used to transform petunia Falcon red (Sakata Seed Co.), Baccarat red (Sakata Seed Co.). and Titan red 
(Sakata Seed Co.) by the method described in Plant Molecular Biology Manual (Kluwer Academic Publishers). These 
Petals of petunia contain cyanidin-3-glucoside as a major arrthocyanin. 

. Transformation was also carried out to a rose var. Lavande by the above-mentioned method. 

so Example 19 . Synthesis of the full-length cDNA of the acyltransfera.se derived from cinerarias 

The cDNA of cineraria acyltransferase gene. pCAT208, as hereinabove described, was not full-length. Accordingly, 
a full-length cDNA containing the initiation codon was synthesized based on the nucleotide sequence of the gentian 
acyltransferase gene. pGAT4. Thus, the DNA shown below was synthesized. The amino acid sequence encoded by the 
55 DNA is also shown. The first underline means a BamHI recognition sequence for cloning, and the next underline means 
a sequence contained in pCAT208. 
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5 ' G GGATCCA ACA ATG GAG CAA ATC CAA ATG G TG AAC ATT CTC GAA C 3 ' 
Met Glu Gin He Gin Met Val Asn He Leu Glu 
(SEQ ID No. 29) 



to PCR reaction was carried out in a final volume of 50 nl containing 100 ng each of this primer and -20 primer, and 
10 ng of pCAT8. The reaction was carried out for 15 cycles with one cycle comprising 95°C for 1 minute. 55°C for 1 
minute, and 72°C for 2 minutes. After the reaction was over, DNA fragments were recovered from the reaction mixture 
using GENECLEAN (Biol01) by the method recommended by the manufacturer. After digesting the recovered DNA with 
BamHI and Mval. followed by agarose gel electrophoresis, a DNA fragment of about 200 bp was recovered. This DNA 

is fragment was ligated to a DNA fragment of about 1 .3kb obtained by digesting pCAT8 with Mval and Xhol and plasmid 
pBluescript SK- digested with BamHI and Xhol, and the plasmid obtained was termed pCATF208. The nucleotide 
sequence of this plasmid was determined from the 5' end of the cDNA to confirm the nucleotide sequence. 

Exa m ple 2 Q . Cloning of cDNA encoding the acyltransferase derived from lavenders 

20 

A cDNA library derived from petals of lavender of the perilla family, Lavandula anaustifolia . was constructed by the 
method as set forth in Example 3. and screened by the plaque hybridization technique detailed in Example 3. About 
300.000 clones were screened. Thus, the probe used was obtained by carrying out PCR reaction in a final volume of 
50 pJ using 100 ng each of the synthetic nucleotide Rl primer: 5'-CTCGG AGGAATTCGGC ACGAC-3 ' (SEQ ID No. 30) 

ss and the oligo 2, 1 0 ng of pSAT208. and a D IG-labelled nucleotide as the nucleotide. The reaction was carried out for 25 
cycles with one cycle comprising 95°C for 1 minute, 50°C for 1 minute, and 72°C for 2 minutes. 

The labelled cDNA fragment was added to the hybridization solution and hybridization was carried out at 37°C for 
further 16 hours. The filter was washed with the washing solution (5 x SSC, 1% SDS) and then an enzymeimmu- 
noassay (Boehriner Mannheim) using the DIG-specific antibody labelled with alkaline phosphatase was carried out to 

30 detect positive clones by color development using 5-bromo-4-chloro-3-indolyl phosphate and nitroblue tetrazolium salt. 
The detection method used was as set forth in the manufacturer's instructions. 

As a result, one positive clone was obtained. This cDNA was rescued in the form which employs plasmid pBlue- 
script SK- as a vector from the form which employs X phage as the vector using the method recommended by the man- 
ufacturer. A plasmid was extracted from the clone obtained and was termed pLARTI. As described above, the 

35 nucleotide sequence in the vicinity of the 5' end of cDNA of pLAT1 was determined using the ABI373 DNA Sequencer 
(Perkin Elmer) by the pigment deoxy sequence method with the f luorogenic pigment recommended by the manufac- 
turer. The amino acid sequence deduced from the nucleotide sequence thus obtained has a high homology with the 
amino acid sequence of acyltransf erase of perillas and gentians, suggesting that pLAT1 encodes the acyltransferase of 
lavenders. But. the amino acid sequence encoded by pl_AT1 is shorter than that of acyltransferase of perillas or gen- 

40 tians and it is, conceivably, not long enough to encode the entire length of acyltransferase. Accordingly, using the cDNA 
fragment of pLAT1 labelled with DIG as the probe, the cDNA library of lavenders was screened under the same condi- 
tion mentioned above. The probe was labelled by the PCR reaction in a final volume of 50 nl containing about 1 ng of 
pLAT1 plasmid as the template, 500 ng each of the Rl primer described below and the oligo 2. and 8 jil of dNTP-labelled 
mixture (Boehringer). The PCR reaction was carried out for 25 cycles with one cycle comprising 95°C for 1 minute. 

45 42°C for 2 minutes, and 72°C for 3 minutes, and the reaction was kept at 72°C for 7 more minutes in order to perfect 
the elongation reaction. The plaque hybridization was carried out by the method described above except that the con- 
centration of formamide in the hybridization buffer was 50% and that the filter was washed with 2 x SSC and 1% SDS. 
The nucleotide sequence in the vicinity of the 5' end of cDNA of the positive clone obtained was determined as 
described above, and a clone. pl_AT21 . which is 1 1 bp longer than pLAT1 was obtained. This is shown in the sequence 

so listing SEQ ID No. 6. However, pLAT21 did not contain the methionine initiation codon either and was not long enough 
to encode the entire length. 

Example 21 . Synthesis of the full-lenoth cDNA of the acyltransferase derived from lavenders 

55 Since the cDNA, pLAT21 , which is considered to encode the acyltransferase of lavenders does not contain the 
methionine initiation codon. the methionine initiation codon must be added to the 5' end of the cDNA in order to permit 
its expression in yeast. Accordingly, using a primer as described below. PCR reaction was carried out to synthesize a 
fragment in which the methionine initiation codon has been added to the 5' end of pLAT21. The primer LAT-ATG is 
designed so that it contains, in addition to 20 nucleotide sequences at the 5' end of pLAT21, the methionine initiation 
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codon. the conserved sequence AACA for gene expression in plant which is believed to be present adjacent to the 
upstream thereof and the restriction enzyme BamHI recognition site required for ligation to a yeast expression vector in 
the direction of 5' upstream to 3'. The LAT-ATG primer (SEQ ID No. 31): 

5'-AGTC££AT££AACA A TG ACC AC C CTC CTC GAA TCC 3' 
Thr Thr Leu Leu Glu Ser 



PCR reaction was carried out in a final volume of 50 ul containing about 100 ng of the pLAT21 plasmid as the tem- 
plate, and 500 ng each of the LAT-ATG primer and the oligo 2 primer. PCR reaction was carried out for 10 cycles with 
one cycle comprising 95°C for 1 minute, 42°C for 2 minutes, and 72°C for 3 minutes, and the reaction was kept at 72°C 
for 7 more minutes in order to perfect the elongation reaction. The DNA fragment thus obtained was cleaved with 

15 BamHI and EcoRI and a DNA fragment of about 550 bp was recovered. This DNA fragment was subcloned into the 
BamHI and EcoRI sites of the plasmid vector plasmid pBluescript SK- and termed pLATPCR11. The nucleotide 
sequence of pLATPCR11 was determined as mentioned before, and it was confirmed that this PCR-amplified DNA 
fragment had the same sequence as that of from the 5' end to EcoRI site of pLAT21 cDNA. and contained the methio- 
nine initiation codon in the LAT-ATG primer and the conserved sequence for gene expression in plants, and the restric- 

20 tion enzyme BamHI recognition site required for ligation to a yeast expression vector. 

Furthermore, the entire nucleotide sequence of pLAT21 was determined in a similar method to the one used to 
determine the nucleotide sequence of cDNA of pGAT4. The amino acid sequence expected to be encoded by this 
cDNA had a homology of 69%. 38%, 37%, 37%, and 19% with pSAT208, pGAT4, pGAT8. pGAT106. and pPAT48, 
respectively. 

25 

Example 22 . Expression of the acvltransferase aene derived from lavenders in veast 

A plasmid obtained by ligating a DNA fragment of about 550 bp cleaved out from pLATPCR1 1 with BamHI and 
EcoRI, a DNA fragment of about 1 kb obtained by cleavage of pLAT21 with EcoRI and Xhol. and a DNA fragment of 
30 about 8 kb obtained by cleavage of pYE22m with BamHI and Salt was designated as pYELAT21. As hereinabove 
explained, yeast G1315 was transformed with this plasmid and the activity of acyltransferase was determined. 

As a result, in the yeast into which pYEI_AT21 was introduced, formation of 19.9 nmol of delphinidin 3-caffeoylglu- 
coside 5-glucoside from 24 nmol of delphinidin 3,5-diglucoside and 21.5 nmol of caffeoyl-CoA was observed. Thus, it 
was confirmed that the synthesized full-length cDNA cotained in pYELAT21 encodes the activity of acyltransferase. 

35 

Example 23 . Construction of a binary vector containing the acvltransferase oene derived from lavenders 

A plasmid obtained by ligating a DNA fragment of about 550 bp cleaved out from pLATPCR1 1 with BamHI and 
EcoRI, a DNA fragment of about 1 kb obtained by cleavage of pLAT21 with EcoRI and Xhol, and a DNA fragment of 
40 about 1 1 kb obtained by digestion of pBEGUSx with BamHI and Xhol was designated as pBELA1 1. As hereinabove 
explained, this was transformed into Agrobacterium tumefacience strain Ag1 0 and was supplied for transformation of 
petunias and roses. 

Example 24 . Construction of antibody against acvltransferase 

45 

As a means to obtain the gene of an enzyme whose amino acid sequence is similar to that of the desired enzyme, 
there is mentioned a method in which the cDNA library of the expression form is screened by antibody to an enzyme. 
In this case, an antibody against the acyttransf erase encoded by pGAT4 of gentians was produced. Similarly, it is pos- 
sible to produce antibodies to other acyHransf erases. 
so First, using the Bulk and RediPack GST Purification Modules (pharmacia Biotech), the GAT4 protein was 
expressed in large quantities using E. cgH. from which the antibody was purified. 

( 1 ) Construction of expression plasmid 

55 pGEX-4T-1 was used to express the acyltransferase gene in E. cofi. Using this pGEX-4T-1 a fusion protein with gl u- 
tathione S-transferase can be prepared, which is purified efficiently using an affinity column of glutathione S-trans- 
ferase. 

After pGEX-4T-1 was digested with Smal and Xhol, it was blunt-ended using the DNA blunting kit (Takara Shuzo). 
The DNA fragment of about 4.9 kb thus obtained was dephosphorylated using alkaline phosphatase BAP C75 (Takara 
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Shuzo). A DNA fragment of about 1 .6 kb obtained by digestion of pGAT4 with Smal and Kpnl present in said vector was 
blunt-ended as described before, and was recombined with the above-mentioned the blunt-ended site after digestion of 
pGEX-4T-1 with Smal and Xhol to construct pGEXGAT4. By digesting with EcoRI and Bglll, rt was confirmed that the 
cDNA and glutathione S-transferase on pGAT4 were in the same direction. 

5 

(2) Expression of acyltransferase in ILeqH 

E. soli strain JM109 was transformed with pGEXGAT4. Tansformation of E col was carried out by the method of 
Hanahan (J. Mol. Biol. 166: 557-, 1983). The transformed E S2li was inoculated to 50 ml of 2 x YT medium (tryptone 

10 16 g, yeast extract 10 g, and sodium chloride 5 g were dissolved into one liter of distilled water, and then pH was 
adjusted to 7.0 with sodium hydroxide) containing ampicillin (100 \iq/\) and 2% glucose, and then cultured overnight at 
37°C . Forty ml of the culture was inoculated into 40 ml of 2 x YT medium containing ampicillin (100 ng/1) and 2% glu- 
cose, followed by incubation at 37°C for 3 hours, to which 440 nl of IPTG (final concentration 10 mM) was added and 
cultured for 5 more hours. After harvesting, the cells were suspended to 100 ml of 1 x PBS (sodium chloride 8.2 g, 

is potassium chloride 2.0 g, disodium hydrogen phosphate 1 .43 g, and potassium dihydrogen phosphate 2.45 g were dis- 
solved in one liter of distilled water) containing 10 of APMSF. After the suspension was disrupted by sonication, 5 
ml of 20% Triton X-100 was added (final concentration 1%). After shaking in ice for 30 minutes, it was centrifuged at 
1 2,000 rpm for 1 0 minutes. The precipitate obtained was suspended to 12 ml of 6 M urea, to which was added an equal 
amount of 2 x SDS sample buffer and treated at 90°C for 5 minutes to prepare a sample. 

20 This sample (0.8 m!) was separated on disk gel electrophoresis (separating gel 7.5% acrylamide, stacking gel 5% 
acrylamide: ATTO BIO PHORESISI III) and collected in aliquots of 0.8 ml. Each fraction was analyzed on SDS-polyacr- 
ylamide gel electrophoresis (separating gel 10% acrylamide, stacking gel 4.5% acrylamide). The result indicated that 
there was a fraction in which a protein having a molecular weight of about 75,000 corresponding to the size of a fusion 
protein of the acyltransferase and glutathione S-transferase encoded by pGAT4 was present as a single protein. 

25 This fraction (3.2 ml) was concentrated by Centricon 1 0 (Amicon) to obtain about 0.3 jig of the fusion protein. Using 
this sample, antibody was produced using BALB/C mice by the conventional method. Using this antibody, a homologs 
of acyltransferase can be obtained. Industrial Applicability 

As hereinabove described, in accordance with the present invention, aromatic acyltransferase derived from gen- 
tians was purified, the cDNA of said enzyme was cloned, and the nucleotide sequence of said cDNA was determined. 

30 Furthermore, by expressing the activity in E. coK and yeast, the separated cDNA was confirmed to be identical with the 
one encoding aromatic acyltransferase. 

Thus, by connecting the cDNA according to the present invention to a suitable plant expression vector and then 
introducing rt into a plant, it became possible to utilize acylation reaction in order to control the color of flowers. 

Furthermore, by utilizing the present enzymatic activity, it is possible to modify the structures of anthocyanins in 

35 plants or in vitro in order to provide more stable anthocyanins. 



45 
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SEQUENCE LISTING 

SEQ ID NO: 1 
SEQUENCE LENGTH: 1703 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL SEQUENCE: No 
ANTI -SENSE: No 
ORIGINAL SOURCE 

ORGANISM: Gentiana triflora var. japonica 

TISSUE TYPE: petal 
IMMEDIATE SOURCE 

LIBRARY: cDNA library 

CLONE : pGAT4 
SEQUENCE DESCRIPTION 

TCATT ATG GAG CAA ATC CAA ATG GTG AAG GTT CTT GAA AAA TGC CAA 
Met Glu Gin He Gin Met Val Lys Val Leu Glu Lys Cys Gin 
-11 5 10 

GTT ACA CCA CCA TCT GAC ACA ACA GAT GTC GAG TTA TCG CTA CCG GTA 
Val Thr Pro Pro Ser Asp Thr Thr Asp Val Glu Leu Ser Leu Pro Val 

15 20 25 

ACA TTC TTC GAT ATC CCC TGG TTG CAC TTG AAT AAG ATG CAG TCC CTT 
Thr Phe Phe Asp He Pro Trp Leu His Leu Asn Lys Met Gin Ser Leu 
30 35 40 45 

CTG TTT TAC GAC TTT CCG TAC CCA AGA ACA CAT TTC TTG GAC ACT GTT 
Leu Phe Tyr Asp Phe Pro Tyr Pro Arg Thr His Phe Leu Asp Thr Val 

50 55 60 

ATC CCT AAT CTT AAG GCC TCT TTG TCT CTC ACT CTA AAA CAC TAC GTT 
He Pro Asn Leu Lys Ala Ser Leu Ser Leu Thr Leu Lys His Tyr Val 

65 70 75 

CCG CTT AGC GGA AAT TTG TTG ATG CCG ATC AAA TCG GGC GAA ATG CCG 
Pro Leu .Ser Gly Asn Leu Leu Met Pro He Lys Ser Gly Glu Met Pro 

80 85 90 

AAG TTT CAG TAC TCC CGT GAT GAG GGC GAC TCG ATA ACT TTG ATC GTT 
Lys Phe Gin Tyr Ser Arg Asp Glu Gly Asp Ser He Thr Leu He Val 
95 100 10S 
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no 

GAT 
Asp 

AGG 
Arg 

ACC 
Thr 

TCA 
Ser 

TAT 
Tyr 
190 
CTT 
Leu 

GAA 
Glu 

TTT 
Phe 

CTC 
Leu 

AGA 
Arg 
270 
GGA 
Gly 



I GAG TCT 
i Glu Ser 

TCC AAT 
Ser Asn 

ACC ATG 
Thr Met 

GTT TTT 
Val Phe 
160 
ATT GCA 
He Ala 
175 

ATT AAC 
He Asn 

CCA TCT 
Pro Ser 

ACA TTT 
Thr Phe 

GGA AGC 
Gly Ser 

TCC CTT 
Ser Leu 
255 

GGA TCC 
Gly Ser 

TAC GTA 
Tyr Val 



GAC CAG GAT 
Asp Gin Asp 
115 

GAT TTG CAT 
Asp Leu His 

130 
CAA GAC TAT 
Gin Asp Tyr 
145 

CCT AAC CGT 
Pro Asn Arg 

GAT GCT AAA 
Asp Ala Lys 

AAA TTT GGG 
Lys Phe Gly 
195 

TTC GAT AGA 
Phe Asp Arg 

210 
TGG AAC GAA 
Trp Asn Glu 
225 

AAA CCC CCT 
Lys Pro Pro 

GCT GAA ATC 
Ala Glu He 

GAA CCG ACA 
Glu Pro Thr 
275 

TGG ACA TGC 
Trp Thr Cys 
290 



TTT GAC TAC 
Phe Asp Tyr 

GGC CTT TTT 
Gly Leu Phe 

AAA GTG ATC 
Lys Val He 
150 

GGC ATA GCC 
Gly He Ala 

165 
AGT TTT GTA 
Ser Phe Val 
180 

AAA GAC GCG 
Lys Asp Ala 

TCG ATA ATC 
Ser He He 

ATG CAA GAT 
Met Gin Asp 
230 

CGA TTC AAC 
Arg Phe Asn 

245 
CAG AAG CTA 
Gin Lys Leu 
260 

ATA CGT GTA 
He Arg Val 

ATG GTC AAA 
Met Val Lys 



CTT AAA 
Leu Lys 
120 
TAT GTT 
Tyr Val 
135 

CCG CTC 
Pro Leu 

GTG GCT 
Val Ala 

ATG TTC 
Met Phe 

GAC TTG 
Asp Leu 
200 
AAA GAT 
Lys Asp 
215 

GTT CTT 
Val Leu 

AAG GTA 
Lys Val 

AAG AAC 
Lys Asn 

ACG ACG 
Thr Thr 
280 
TCA AAA 
Ser Lys 
295 



GGT CAT 
Gly His 

ATG CCA 
Met Pro 

GTA GCC 
Val Ala 

CTG ACG 
Leu Thr 
170 
ATC AAT 
He Asn 
185 

TTG TCC 
Leu Ser 

CTG TAT 
Leu Tyr 

GAA ATG 
Glu Met 

CGA GCT 
Arg Ala 
250 
AAA GTA 
Lys Val 
265 

TTC ACA 
Phe Thr 

GAT GAC 
Asp Asp 



CAA CTG GTA 
Gin Leu Val 
125 

CGG GTT ATA 
Arg Val He 
140 

GTG CAA GTA 
Val Gin Val 
155 

GCA CAT CAT 
Ala His His 

GCT TGG GCC 
Ala Trp Ala 

GCG AAT CTT 
Ala Asn Leu 
205 

GGC CTA GAG 
Gly Leu Glu 

220 
TTC TCT AGA 
Phe Ser Arg 
235 

ACA TAT GTC 
Thr Tyr Val 

CTG AAT CTC 
Leu Asn Leu 

ATG ACG TGT 
Met Thr Cys 
285 

GTC GTA TCA 
Val Val Ser 
300 
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GAG GAA TCA TCG AAC GAC GAA AAT GAG CTC GAG TAC TTC AGT TTT ACA 
Glu Glu Ser Ser Asn Asp Glu Asn Glu Leu Glu Tyr Phe Ser Phe Thr 

305 310 315 

GCG GAT TGC CGA GGA CTT CTG ACG CCC CCG TGT CCG CCT AAC TAC TTT 
Ala Asp Cys Arg Gly Leu Leu Thr Pro Pro Cys Pro Pro Asn Tyr Phe 

320 325 330 

GGC AAC TGT CTT GCG TCA TGC GTT GCA AAA GCA ACA CAT AAA GAG TTA 
Gly Asn Cys Leu Ala Ser Cys Val Ala Lys Ala Thr His Lys Glu Leu 

335 340 345 

GTT GGG GAT AAA GGG CTT CTT GTT GCA GTT GCA GCT ATT GGA GAA GCC 
Val Gly Asp Lys Gly Leu Leu Val Ala Val Ala Ala He Gly Glu Ala 
350 355 360 365 

ATT GAA AAG AGG TTG CAC AAC GAA AAA GGC GTT CTT GCA GAT GCA AAA 
He Glu Lys Arg Leu His Asn Glu Lys Gly Val Leu Ala Asp Ala Lys 

370 375 380 

ACT TGG TTA TCG GAA TCT AAT GGA ATC CCT TCA AAA AGA TTT CTC GGG 
Thr Trp Leu Ser Glu Ser Asn Gly He Pro Ser Lys Arg Phe Leu Gly 

385 390 395 

ATT ACC GGA TCG CCT AAG TTC GAT TCG TAT GGT GTA GAT TTT GGA TGG 
He Thr Gly Ser Pro Lys Phe Asp Ser Tyr Gly Val Asp Phe Gly Trp 

400 405 410 

GGA AAG CCT GCA AAA TTT GAC ATT ACC TCT GTT GAT TAT GCA GAA TTG 
Gly Lys Pro Ala Lys Phe Asp He Thr Ser Val Asp Tyr Ala Glu Leu 

415 420 425 

ATT TAT GTG ATT CAG TCC AGG GAT TTT GAA AAA GGT GTG GAG ATT GGA 
He Tyr Val He Gin Ser Arg Asp Phe Glu Lys Gly Val Glu He Gly 
430 435 440 445 

GTA TCA TTG CCT AAG ATT CAT ATG GAT GCA TTT GCA AAA ATC TTT GAA 
Val Ser Leu Pro Lys He His Met Asp Ala Phe Ala Lys He Phe Glu 

450 455 460 

GAA GGC TTT TGC TCT TTG TCA TAGTCTCTTT AATAGAACCA TATTTGCTGC 
Glu Gly Phe Cys Ser Leu Ser 
465 468 

AATAAAGTAC CAAGTCCTTT AGTAACACTA CACCAAACCC TACTTTCGAG GCGGGAACAC 
CACAACGAGG TTCAATCACT AGAAGGTTGT ACTTCATAAA TTCCAGAGGT CGAATATACA 
CCGTTGTCCT CTGAAAAGTT GAACCTCACA CCTGACATGG TGTTACGATA GGTATTGTAT 
AATGCCATTA TATACTTCCA TAAAGTATCC TATGCAATAG AGAACATGTT ATGTGTTAAA 



1502 
1562 
1622 
1682 
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AAAAAAAAAA AAAAAAAAAA A 

SEQ ID NO: 2 

SEQUENCE LENGTH: 1622 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 

MOLECULE TYPE: cDNA to mRNA 

HYPOTHETICAL SEQUENCE: No 

ANTI-SENSE : No 

ORIGINAL SOURCE 

ORGANISM: Gentiana triflora var. japonica 

TISSUE TYPE: petal 
IMMEDIATE SOURCE 

LIBRARY: cDNA library 

CLONE: pGAT106 
SEQUENCE DESCRIPTION 

GAACCATTGA ATCCAATTAA TCTGATTTAT TAAG ATG GCA GGA AAT TCC GAG 
Met Ala Gly Asn Ser Glu 
1 5 
GAT ATC AAA GTT CTT GAG AAA TGC CGT GTT GCG CCA CCA CCG GAC GCC 
Asp lie Lys Val Leu Glu Lys Cys Arg Val Ala Pro Pro Pro Asp Ala 

10 15 20 

GTC GCC GAG TTT ACA GTC CCA CTG TCG TTT TTC GAC ATG CGA TGG TTG 
Val Ala Glu Phe Thr Val Pro Leu Ser Phe Phe Asp Met Arg Trp Leu 

25 30 35 

ATC TCT GAT GCA GAA CAC CAT CTG CAT TTC TAC AGA TTC CGC CAT CCT 
He Ser Asp Ala Glu His His Leu His Phe Tyr Arg Phe Arg His Pro 

AO 45 50 

TGT CCC AAC TCT AAA TTT ATC ATT TCA TCC ATT AAA TCG TCC CTT TCC 
Cys Pro Asn Ser Lys Phe He He Ser Ser He Lys Ser Ser Leu Ser 
55 60 65 70 

CTT GTT CTC AAA CAC TTT CTT CCG TTA GCC GGG AAT TTG ATT TGG CCG 
Leu Val.Leu Lys His Phe Leu Pro Leu Ala Gly Asn Leu He Trp Pro 

75 80 85 

GTA GAT TCC TCC GAT AGA ATG CCG GAG TTG CGT TAC AAG AAA GGG GAC 
Val Asp Ser Ser Asp Arg Met Pro Glu Leu Arg Tyr Lys Lys Gly Asp 
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TCC GTT TCT TTA ACA ATT GCA GAA TCG AGC ATG GAT TTT GAT TAT CTC 
Ser Val Ser Leu Thr lie Ala Glu Ser Ser Met Asp Phe Asp Tyr Leu 

105 110 115 

GCC GGA GAT CAT CAG AGG GAT TCT TAT AAA TTC AAC GAT TTG ATT CCG 
Ala Gly Asp His Gin Arg Asp Ser Tyr Lys Phe Asn Asp Leu lie Pro 

120 125 130 

CAG CTG CCA GAA CCG ATT GTA ACC TCC GGC GAC GAA GTA TTA CCA CTT 
Gin Leu Pro Glu Pro He Val Thr Ser Gly Asp Glu Val Leu Pro Leu 
135 1A0 145 150 

TTT GCT TTA CAG GTG ACG GTG TTC TCC AAC ACC GGT ATA TGC ATT GGA 
Phe Ala Leu Gin Val Thr Val Phe Ser Asn Thr Gly He Cys He Gly 

155 160 165 

CGC AAT CTT CAT CAA GTT CTT GGT GAT GCC AGT TCT TTT CTG CAT TTT 
Arg Asn Leu His Gin Val Leu Gly Asp Ala Ser Ser Phe Leu His Phe 

170 175 180 

AAT AAA TTA TGG GTT TTG GTT GAC AAA TCC AAT GGA GAT TCA TTA AAG 
Asn Lys Leu Trp Val Leu Val Asp Lys Ser Asn Gly Asp Ser Leu Lys 

185 190 195 

TTC CTT CCA CTT TCT TCT CTA CCT ATG TAC GAC AGA TCT GTG GTG CAA 
Phe Leu Pro Leu Ser Ser Leu Pro Met Tyr Asp Arg Ser Val Val Gin 

200 205 210 

GAT CCA TTT CAT ATT CGT CGA AAA ATC TAC AAT GAA AGA AAA CTG CTC 
Asp Pro Phe His He Arg Arg Lys Leu Tyr Asn Glu Arg Lys Leu Leu 
215 220 225 230 

AAA TCT CAG GGC ACA CCT ACT GTT CTA AAT CCA GCA ATT TCT AAA GAT 
Lys Ser Gin Gly Thr Pro Thr Val Leu Asn Pro Ala He Ser Lys Asp 

235 240 245 

GAA GTT CGA GCC ACC TTC ATC CTA CAC CCT ATT GAT ATC ATG AAG CTC 
Glu Val Arg Ala Thr Phe He Leu His Pro He Asp He Met Lys Leu 

250 255 260 

AAG AAA TTC ATT TCG TCA AAA AAT CGC AAC TTA ACC GGT AGT AGT AAT 
Lys Lys Phe He Ser Ser Lys Asn Arg Asn Leu Thr Gly Ser Ser Asn 

.265 270 . . . 275 

TAT AAT CTG TCA ACT TTC ACG GTG ACA TCT GCA CTG ATC TGG ACA TGC 
Tyr Asn Leu Ser Thr Phe Thr Val Thr Ser Ala Leu He Trp Thr Cys 
280 285 290 
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TTG TCG AAA TCA TTA GAC ACC GTC GTA AGA GAG AAG GTG GAA GAG GAT 
Leu Ser Lys Ser Leu Asp Thr Val Val Arg Glu Lys Val Glu Glu Asp 
295 300 305 310 

AAA CAT GCA GCA AAC TTA TGT GCT TTC ATC AAC TGC CGA CAA CGT TTT 
Lys His Ala Ala Asn Leu Cys Ala Phe He Asn Cys Arg Gin Arg Phe 

315 320 325 

GCT CCG CCG ATA CCT CAA AAT TAC TTT GGA AAT TGC ATA GTG CCT TGT 
Ala Pro Pro He Pro Gin Asn Tyr Phe Gly Asn Cys He Val Pro Cys 

330 335 340 

ATG GTG GGA TCG ACT CAT GAG CAA CTT GTA GGA AAT GAA GGG TTG TCG 
Met Val Gly Ser Thr His Glu Gin Leu Val Gly Asn Glu Gly Leu Ser 

345 350 355 

GTA GCT GCA ACC GCC ATC GGA GAT GCT ATC CAT AAG AGG TTA CAT GAC 
Val Ala Ala Thr Ala He Gly Asp Ala He His Lys Arg Leu His Asp 

360 365 370 

TAC GAA GGA ATT CTG AGA GGA GAT TGG ATA TCG CCG CCC CGA TCA ACA 
Tyr Glu Gly He Leu Arg Gly Asp Trp He Ser Pro Pro Arg Ser Thr 
375 380 385 390 

TCT GCG GCA CCA AGG TCG ACG CTC ATT TAT GTC GTT GGA TCC GCA CAA 
Ser Ala Ala Pro Arg Ser Thr Leu He Tyr Val Val Gly Ser Ala Gin 

395 400 405 

CGC AAT GTG CAT GAT TTT GAT GCA GAT TTT GGT TGG GGA AAG CTT GAA 
Arg Asn Val His Asp Phe Asp Ala Asp Phe Gly Trp Gly Lys Leu Glu 

410 415 420 

AAG CAT GAA TCT GTT TCA ACT AAT CCT TCG GCA ACA CTA ATT TTG ATC 
Lys His Glu Ser Val Ser Thr Asp Pro Ser Ala Thr Leu He. Leu He 

425 430 435 

TCT CGG TCC AGA AGA TTT AAA GGA GCA CTT GAG CTT GGC ATT TCT TTG 
Ser Arg Ser Arg Arg Phe Lys Gly Ala Leu Glu Leu Gly He Ser Leu 

440 445 450 

CCT AAG AAT AGG ATG GAC GCA TTT GCC ACC ATT TTT ACG AAT TTC ATC 
Pro Lys Asn Arg Met Asp Ala Phe Ala Thr He Phe Thr Asn Phe He 
455 . 460 465 470 

AAT AGT CTC CAT GTG AGG AGC CCT TTG TAAGAAAAAA GTGGTATCAA 
Asn Ser Leu His Val Arg Ser Pro Leu 
475 479 
TGTATAAAAA AGACAGACAA GTTATGATGC AACAAATGTT TTAGGAGATT ACAAATCCAT 
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GGGAAGATGT ATCAAACTCA TCTCTCTATA TATATATATT CAATTGTTTT AAAAAAAAAA 1611 

AAAAAAAAAA A 1622 

SEQ ID NO: 3 
SEQUENCE LENGTH: 1605 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL SEQUENCE: No 
ANTI -SENSE: No 
ORIGINAL SOURCE 

ORGANISM: Petunia hybrida 

TISSUE TYPE: petal 
IMMEDIATE SOURCE 

LIBRARY: cDNA library 

CLONE: pPAT48 
SEQUENCE DESCRIPTION 

TGTCGACGAA ATCCATTTCA TTTCCTCTTC TTTCTTGTTT TTCTAATTTC GTCATCATTG 60 
TTATCC ATG GCA GGT GAA GTA GCA AAA CAA GAA GTT ACA AAA GTG AAA 108 
Met Ala Gly Glu Val Ala Lys Gin Glu Val Thr Lys Val Lys 
1 5 10 

GTC CTG AAA AAA ACA AAC GTG AAA CCA CAT AAA CCA CTA GGA AAA AAA 156 
Val Leu Lys Lys Thr Asn Val Lys Pro His Lys Pro Leu Gly Lys Lys 
15 20 25 30 

GAG TGT CAA TTG GTA ACA TTT GAT CTT CCT TAC CTA GCT TTC TAT TAC 204 
Glu Cys Gin Leu Val Thr Phe Asp Leu Pro Tyr Leu Ala Phe Tyr Tyr 

35 40 45 

AAC CAA AAA TTT CTC ATC TAT AAA GGT GCT GAA AAC TTT GAC GAG ACG 252 
Asn Gin Lys Phe Leu lie Tyr Lys Gly Ala Glu Asn Phe Asp Glu Thr 

50 55 60 

GTG GAA AAA ATT AAA GAT GGA CTG GCC TTA GTA TTG GTG GAT TTC TAT 300 
Val Glu Lys lie Lys Asp Gly Leu Ala Leu Val Leu Val Asp Phe Tyr 

.65 70 . 75 

CAA CTA GCT GGG AAA CTT GGA AAA GAT GAA GAA GGG GTT TTC AGG GTG 3 48 

Gin Leu Ala Gly Lys Leu Gly Lys Asp Glu Glu Gly Val Phe Arg Val 
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GAA TAC 
Glu Tyr 

95 
GAA GAG 
Glu Glu 

TTC CAG 
Phe Gin 

CAT CGC 
His Arg 

ACC ATG 
Thr Met 
160 
TGG CAC 
Trp His 
175 

ATT TCT 
lie Ser 

GTC AAG 
Val Lys 

TCA GCA 
Ser Ala 

GAA AGA 
Glu Arg 
240 
ACA GTC 
Thr Val 
255 

CTC TCC 
Leu Ser 



GAC GAT 
Asp Asp 

ATA GAA 
lie Glu 

GAC TTG 
Asp Leu 
130 
CCT CTT 
Pro Leu 
145 

GGA TTA 
Gly Leu 

TTT ATG 
Phe Met 

GTC CCA 
Val Pro 

CTC AAC 
Leu Asn 
210 
ACC AAC 
Thr Asn 
225 

GTA TTC 
Val Phe 

AAT GCC 
Asn Ala 

GCA CAC 
Ala His 



GAC ATG 
Asp Met 
100 
GTT GCA 
Val Ala 
115 

ATT CCT 
lie Pro 

CTA GCT 
Leu Ala 

GCA TTT 
Ala Phe 

ACC TCG 
Thr Ser 
180 
CCA TTC 
Pro Phe 
195 

CTC TCT 
Leu Ser 

GGT GAT 
Gly Asp 

AAG TTC 
Lys Phe 

AAC TCA 
Asn Ser 
260 
GTG TGG 
Val Trp 
275 



GAT GGT 
Asp Gly 

GAT CTT 
Asp Leu 

TGT AAT 
Cys Asn 

GTG CAG 
Val Gin 
150 
AAC CAT 
Asn His 
165 

TGG TCC 
Trp Ser 

CTT GAA 
Leu Glu 

CAA CCA 
Gin Pro 

GTC CCG 
Val Pro 
230 
TCC GAG 
Ser Glu 
245 

GGA GAG 
Gly Glu 

CTA GCC 
Leu Ala 



GTA GAG 
Val Glu 

ACT GAT 
Thr Asp 
120 
AAA ATC 
Lys He 
135 

CTC ACC 
Leu Thr 

GCT GTG 
Ala Val 

GAG CTT 
Glu Leu 

CGA ACC 
Arg Thr 
200 
TCA GAT 
Ser Asp 
215 

GCC AAC 
Ala Asn 

TTA GCA 
Leu Ala 

ACG CCA 
Thr Pro 

GTC ACA 
Val Thr 
280 



GTG ACA 
Val Thr 
105 

GAA GAA 
Glu Glu 

TTG AAT 
Leu Asn 

AAG CTC 

Lys Leu 

CTG GAT 
Leu Asp 
170 
TGC TGT 
Cys Cys 
185 

AAG GCT 
Lys Ala 

GCA CCC 
Ala Pro 

GTA GAC 
Val Asp 

ATT GAC 
He Asp 
250 
TTC TCC 
Phe Ser 
265 

CGT GCG 
Arg Ala 



GTG GCT 
Val Ala 

GGC ACC 
Gly Thr 

TTG GAA 
Leu Glu 
140 
AAG GAC 
Lys Asp 
155 

GGT ACT 
Gly Thr 

GGG TCC 
Gly Ser 

CGT AAC 
Arg Asn 

GAA CAT 
Glu His 
220 
CCA CCT 
Pro Pro 
235 

AAA ATC 
Lys He 



GTT GCA 
Val Ala 
110 
ACC AAA 
Thr Lys 
125 

GGG CTT 
Gly Leu 

GGG CTC 
Gly Leu 

TCG ACG 
Ser Thr 

ACC TCA 
Thr Ser 
190 
ACT CGA 
Thr Arg 
205 

GCT AAG 
Ala Lys 

CTT CGC 
Leu Arg 

AAG TCA 
Lys Ser 



ACA TTC CAA TCA 
Thr Phe Gin Ser 
270 

CGC CAA CTC AAG 
Arg Gin Leu Lys 
285 
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CCC GAG GAC TAC ACT GTG TAC ACT GTG TTT GCT GAT TGC AGG AAA AGG 972 
Pro Glu Asp Tyr Thr Val Tyr Thr Val Phe Ala Asp Cys Arg Lys Arg 

290 295 300 

GTT GAT CCT CCA ATG CCA GAA AGT TAC TTC GGC AAC CTA ATT CAG GCA 1020 
Val Asp Pro Pro Met Pro Glu Ser Tyr Phe Gly Asn Leu lie Gin Ala 

305 310 315 

ATT TTC ACA GTG ACC GCG GCA GGT TTG TTA CTA GCA AGC CCG ATC GAG 1068 
He Phe Thr Val Thr Ala Ala Gly Leu Leu Leu Ala Ser Pro He Glu 

320 325 330 

TTC GCT GGT GGG ATG ATA CAA CAA GCG ATC GTG AAG CAT GAC GCT AAG 1116 
Phe Ala Gly Gly Met He Gin Gin Ala He Val Lys His Asp Ala Lys 
335 340 345 350 

GCC ATT GAT GAA AGA AAC AAG GAG TGG GAG AGC AAC CCG AAG ATC TTT 1164 
Ala He Asp Glu Arg Asn Lys Glu Trp Glu Ser Asn Pro Lys He Phe 

355 360 365 

CAG TAC AAA GAT GCT GGA GTG AAC TGT GTT GCT GTT GGA AGT TCG CCA 1212 
Gin Tyr Lys Asp Ala Gly Val Asn Cys Val Ala Val Gly Ser Ser Pro 

370 375 380 

AGG TTC AAG GTT TAC GAC GTG GAT TTT GGA TGG GGA AAG CCA GAG AGT 1260 
Arg Phe Lys Val Tyr Asp Val Asp Phe Gly Trp Gly Lys Pro Glu Ser 

385 390 395 

GTG AGG AGT GGT TCG AAC AAT AGG TTT GAT GGA ATG GTG TAT TTG TAC 1308 
Val Arg Ser Gly Ser Asn Asn Arg Phe Asp Gly Met Val Tyr Leu Tyr 

400 405 410 

CAA GGC AAA AAT GGA GGA AGA AGC ATT GAT GTG GAG ATT AGT TTG GAA 1356 
Gin Gly Lys Asn Gly Gly Arg Ser He Asp Val Glu He Ser Leu Glu 
415 420 425 430 

GCA AAT GCT ATG GAG AGG TTG GAG AAA GAT AAA GAG TTC CTC ATG GAA 1404 
Ala Asn Ala Met Glu Arg Leu Glu Lys Asp Lys Glu Phe Leu Met Glu 

435 440 445 

ACT GCT TAATTTGCTT AGCTTGGACT CAACTGGCTA CACTTTATTT ATGAGCTGCT 1460 
Thr Ala 

ATGACTCACA TGCATGTATG TTTATTTTTT TTGGAGGGGT TCTTTCCTTT TATTGTTTTC 1520 
TATGTTTTTT CTTTCTTGTA CGTTATGAAG AGAAACCGAG TATAAAGGAA TAATGTTTTC 1580 
AGTTATTAAA AAAAAAAAAA AAAAA 1605 
SEQ ID NO: 4 
SEQUENCE LENGTH: 1479 
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SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: cDNA to mRNA 

HYPOTHETICAL SEQUENCE: No 

ANTI-SENSE: No 

ORIGINAL SOURCE 

ORGANISM: Perilla ocimoides 

TISSUE TYPE: leaf 
IMMEDIATE SOURCE 

LIBRARY: cDNA library 

CLONE: pSAT208 
SEQUENCE DESCRIPTION 

CC GTG ATC GAA ACG TGT AGA GTT GGG CCG CCG CCG GAC TCG GTG GCG 
Val lie Glu Thr Cys Arg Val Gly Pro Pro Pro Asp Ser Val Ala 
15 10 15 

GAG CAA TCG GTG CCG CTC ACA TTC TTC GAC ATG ACG TGG CTG CAT TTT 
Glu Gin Ser Val Pro Leu Thr Phe Phe Asp Met Thr Trp Leu His Phe 

20 25 30 

CAT CCC ATG CTT CAG CTC CTC TTC TAC GAA TTC CCT TGT TCC AAG CAA 
His Pro Met Leu Gin Leu Leu Phe Tyr Glu Phe Pro Cys Ser Lys Gin 

35 40 45 

CAT TTT TCA GAA TCC ATC GTT CCA AAA CTC AAA CAA TCT CTC TCT AAA 
His Phe Ser Glu Ser lie Val Pro Lys Leu Lys Gin Ser Leu Ser Lys 

50 55 60 

ACT CTC ATA CAC TTC TTC CCT CTC TCA TGC AAT TTA ATC TAC CCT TCA 
Thr Leu lie His Phe Phe Pro Leu Ser Cys Asn Leu lie Tyr Pro Ser 

65 70 75 

TCC CCG GAG AAA ATG CCG GAG TTT CGG TAT CTA TCC GGG GAC TCG GTT 
. Ser Pro Glu Lys Met Pro Glu Phe Arg Tyr Leu Ser Gly Asp Ser Val 
80 85 90 95 

TCT TTC ACC ATC GCA GAA TCT AGC GAC GAC TTC GAT GAT CTC GTC GGA 
Ser Phe .Thr lie Ala Glu Ser Ser Asp Asp Phe Asp Asp Leu Val Gly 

100 105 110 

AAT CGT CCA GAA TCT CCC GTT AGG CTC TAC AAC TTT GTC CCT AAA TTG 
Asn Arg Pro Glu Ser Pro Val Arg Leu Tyr Asn Phe Val Pro Lys Leu 
115 120 125 
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CCG CCC ATT GTC GAA GAA TCC GAT AGA AAA CTC TTC CAA GTT TTC GCC 
Pro Pro lie Val Glu Glu Ser Asp Arg Lys Leu Phe Gin Val Phe Ala 

130 135 140 

GTG CAG GTG ACT CTT TTC CCA GGC CGA GGC GTC GGT ATT GGA ATA GCA 
Val Gin Val Thr Leu Phe Pro Gly Arg Gly Val Gly He Gly He Ala 

145 150 155 

ACG CAT CAC ACC GTT AGC GAC GCC CCG TCG TTT CTC GCG TTT ATA ACG 
Thr His His Thr Val Ser Asp Ala Pro Ser Phe Leu Ala Phe He Thr 
160 165 170 175 

GCT TGG TCT TCA ATG AGC AAA CAC ATT GAA AAT GAA GAT GAA GAT GAA 
Ala Trp Ser Ser Met Ser Lys His He Glu Asn Glu Asp Glu Asp Glu 

180 185 190 

GAA TTT AAA TCT TTG CCA GTT TTC GAT AGA. TCC GTC ATA AAA TAT CCG 
Glu Phe Lys Ser Leu Pro Val Phe Asp Arg Ser Val He Lys Tyr Pro 

195 200 205 

ACG AAA TTT GAC TCC ATT TAT TGG AGA AAC GCG CTA AAA TTT CCT TTG 
Thr Lys Phe Asp Ser He Tyr Trp Arg Asn Ala Leu Lys Phe Pro Leu 

210 215 220 

CAA TCT CGT CAT CCC TCA TTA CCG ACG GAC CGC ATT CGA ACC ACG TTC 
Gin Ser Arg His Pro Ser Leu Pro Thr Asp Arg He Arg Thr Thr Phe 

225 230 235 

GTT TTC ACC CAA TCC AAA ATT AAG AAA TTG AAG GGT TGG ATT CAG TCC 
Val Phe Thr Gin Ser Lys He Lys Lys Leu Lys Gly Trp lie Gin Ser 
240 245 250 255 

AGA GTT CCA AGT TTA GTC CAT CTC TCA TCT TTT GTA GCG ATT GCA GCT 
Arg Val Pro Ser Leu Val His Leu Ser Ser Phe Val Ala He Ala Ala 

260 265 270 

TAT ATG TGG GCT GGC ATA ACG AAA TCA TTC ACA GCA GAT GAA GAC CAA 
Tyr Met Trp Ala Gly lie Thr Lys Ser Phe Thr Ala Asp Glu Asp Gin 

275 280 285 

GAC AAC GAG GAT GCA TTT TTC TTG ATT CCG GTC GAT CTA AGG CCA CGA 
Asp Asn Glu Asp Ala Phe Phe Leu He Pro Val Asp Leu Arg Pro Arg 

.290 295 300 

TTA GAT CCG CCG GTT CCT GAA AAT TAC TTC GGG AAC TGC TTA TCG TAC 
Leu Asp Pro Pro Val Pro Glu Asn Tyr Phe Gly Asn Cys Leu Ser Tyr 
305 310 315 
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GCG CTG CCG AGA ATG CGG CGG CGA GAG CTG GTG GGA GAG AAA GGG GTG 1007 

Ala Leu Pro Arg Met Arg Arg Arg Glu Leu Val Gly Glu Lys Gly Val 

320 325 330 335 

TTT CTG GCA GCT GAG GTA ATC GCG GCG GAG ATA AAA AAA AGG ATC AAC 1055 

Phe Leu Ala Ala Glu Val He Ala Ala Glu He Lys Lys Arg He Asn 

340 345 350 

GAC AAG AGA ATA TTA GAA ACG GTG GAG AAA TGG TCG CCG GAG ATT CGT 1103 
Asp Lys Arg He Leu Glu Thr Val Glu Lys Trp Ser Pro Glu He Arg 

355 360 365 

AAA GCG TTG CAG AAA TCA TAT TTT TCG GTG GCA GGA TCG AGC AAG CTA 1151 
Lys Ala Leu Gin Lys Ser Tyr Phe Ser Val Ala Gly Ser Ser Lys Leu 

370 375 380 

GAT CTT TAC GGT GCA GAT TTT GGA TGG GGG AAG GCG AGA AAG CAA GAA 1199 
Asp Leu Tyr Gly Ala Asp Phe Gly Trp Gly Lys Ala Arg Lys Gin Glu 

385 390 395 

ATA TTG TCG ATT GAT GGG GAG AAA TAT GCA ATG ACR CTT TGT AAA GCC 1247 
He Leu Ser He Asp Gly Glu Lys Tyr Ala Met Thr Leu Cys Lys Ala 
400 405 410 415 

AGG GAT TTC GAA GGA GGA TTG GAG GTT TGC TTG TCT TTG CCT AAG GAC 1295 
Arg Asp Phe Glu Gly Gly Leu Glu Val Cys Leu Ser Leu Pro Lys Asp 

420 425 430 

AAA ATG GAT GCT TTT GCT GCT TAT TTT TCA CTG GGA ATT AAT GGT 1340 
Lys Met Asp Ala Phe Ala Ala Tyr Phe Ser Leu Gly He Asn Gly 
435 440 446 

TAATAAATGT ATGTAATTAA ACTAATATTA TTATGTAACA ATTAATTAAG TGTTGAGTAA 1400 
CGTGAAGAAT AATCCCTATT ATATATTTAT GATTTGGTTC AAATAAAGTG TAAAGCCTCT 1460 
TGAAAAAAAA AAAAAAAAA 1479 
SEQ ID NO: 5 
SEQUENCE LENGTH: 1508 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL SEQUENCE: No 
ANTI-SENSE: No 
ORIGINAL SOURCE 

ORGANISM: Senecio cruentus 
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TISSUE TYPE: petal 
IMMEDIATE SOURCE 

LIBRARY : cDNA library 

CLONE : pCAT8 
SEQUENCE DESCRIPTION 

TG AAC ATT CTC GAA CAT GCC CGA ATA TCG GCC CCC TCG GGC ACC ATC 
Asn He Leu Glu His Ala Arg He Ser Ala Pro Ser Gly Thr He 
15 10 15 

GGC CAT CGC TCG TTA TCT CTT ACT TTC TTC GAC ATT ACT TGG CTA CTC 
Gly His Arg Ser Leu Ser Leu Thr Phe Phe Asp He Thr Trp Leu Leu 

20 25 30 

TTC CCT CCG GTC CAC CAT CTT TTC TTC TAT GAC TTT CCA CAT TCT AAA 
Phe Pro Pro Val His His Leu Phe Phe Tyr Asp Phe Pro His Ser Lys 

35 40 45 

TCC CAT TTC ATG GAC ACT ATT GTT CCC AGG CTA AAA CAA TCT TTA TCG 
Ser His Phe Met Asp Thr He Val Pro Arg Leu Lys Gin Ser Leu Ser 

50 55 60 

GTC ACT CTT CAA CAT TTT TTC CCG TTT GCT AGT AAT TTG ATT GTA TTT 
Val Thr Leu Gin His Phe Phe Pro Phe Ala Ser Asn Leu He Val Phe 

65 70 75 

CCT AAC ACT GAT GGT TCG GGT TTT AAT AAA AAA CCA GAA ATA AAA CAC 
Pro Asn Thr Asp Gly Ser Gly Phe Asn Lys Lys Pro Glu He Lys His 
80 85 90 95 

GTT GAA GGT GAT TCT GTT GTG GTT ACT TTT GCA GAA TGT TGT CTT GAC 
Val Glu Gly Asp Ser Val Val Val Thr Phe Ala Glu Cys Cys Leu Asp 

100 105 110 

TTT AAT AAT TTG ACA GGA AAT CAT CCT CGA AAA TGT GAA AAC TTT TAT 
Phe Asn Asn Leu Thr Gly Asn His Pro Arg Lys Cys Glu Asn Phe Tyr 

115 120 125 

CCA CTT GTA CCT TCA TTG GGA AAT GCA ATC AAA TTA TGT GAT TGC GTC 
Pro Leu Val Pro Ser Leu Gly Asn Ala He Lys Leu Cys Asp Cys Val 

130 135 140 

ACG GTC CCA CTT TTT TCA CTT CAA GTG ACG TTT TTT CCG GGC TCG GGT 
Thr Val Pro Leu Phe Ser Leu Gin Val Thr Phe Phe Pro Gly Ser Gly 
145 150 155 
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ATA TCA CTA GGA ATG ACG AAT CAT CAT AGC CTT GGT GAC GCT AGC ACG 
lie Ser Leu Gly Met Thr Asn His His Ser Leu Gly Asp Ala Ser Thr 
160 165 170 175 

CGG TTC AAC TTT TTG AAA GGG TGG ACT TCG ATT ATT CAA TCT GGT GTA 
Arg Phe Asn Phe Leu Lys Gly Trp Thr Ser He He Gin Ser Gly Val 

180 185 190 

GAT CGG TCT TTT TTA ACG AAA GGA TCT CCA CCG GTT TTT GAT AGA TTG 
Asp Arg Ser Phe Leu Thr Lys Gly Ser Pro Pro Val Phe Asp Arg Leu 

195 200 205 

ATT AAC ATC CCA CAT TTA GAT GAA AAT AAG TTG AGA CAT ACA AGG CTC 
He Asn He Pro His Leu Asp Glu Asn Lys Leu Arg His Thr Arg Leu 

210 215 220 

GAA AGT TTT TAT AAA CCT TCG AGC CTT GTT GGT CCC ACT GAT AAA GTT 
Glu Ser Phe Tyr Lys Pro Ser Ser Leu Val Gly Pro Thr Asp Lys Val 

225 230 235 

CGG TCA ACG TTT GTG TTG ACC CGA ACT AAT ATC AAT CTA CTA AAG AAA 
Arg Ser Thr Phe Val Leu Thr Arg Thr Asn He Asn Leu Leu Lys Lys 
240 245 250 255 

AAG GTC TTA ACC CAA GTG CCA AAC TTG GAG TAC ATG TCA TCT TTT ACG 
Lys Val Leu Thr Gin Val Pro Asn Leu Glu Tyr Met Ser Ser Phe Thr 

260 265 270 

GTA ACT TGT GGT TAT ATA TGG AGT TGC ATA GCG AAA TCA CTC GTA AAA 
Val Thr Cys Gly Tyr He Trp Ser Cys He Ala Lys Ser Leu Val Lys 

275 280 285 

ATA GGA GAA AGA AAG GGC GAA GAC GAG TTA GAA CAG TTC ATA ATC ACC. 
He Gly Glu Arg Lys Gly Glu Asp Glu Leu Glu Gin Phe He He Thr 

290 295 300 

ATT GAT TGT CGA TCT CGT CTT GAT CCA CCA ATT CCC ACA GCC TAC TTT 
He Asp Cys Arg Ser Arg Leu Asp Pro Pro He Pro Thr Ala Tyr Phe 

305 310 315 

GGT AAC TGT GGT GCA CCA TGT GTC CCG ACC TTA AAA AAT GTC GTT TTG 
Gly Asn Cys Gly Ala Pro Cys Val Pro Thr Leu Lys Asn Val Val Leu 
320 ... 325 . 330 335 

ACT AGC GAA AAT GGG TAT GCA CTT GGT GCT AAA GTA ATT GGA GAG TCT 
Thr Ser Glu Asn Gly Tyr Ala Leu Gly Ala Lys Val He Gly Glu Ser 
340 345 350 
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ATA TGC AAA ATG ATA TAT AAT AAG GAC GGA ATC TTG AAA GAT GCC GCG 1103 
lie Cys Lys Met lie Tyr Asn Lys Asp Gly He Leu Lys Asp Ala Ala 

355 360 365 

AGA TGG CAT GAA CCT TTC ATG ATC CCG GCT AGG AAG ATT GGT GTT GCT 1151 
Arg Trp His Glu Pro Phe Met He Pro Ala Arg Lys He Gly Val Ala 

370 375 380 

GGT ACA CCT AAG CTC AAC TTG TAC GAC TTT GAT TTT GGG TGG GGG AAG 1199 
Gly Thr Pro Lys Leu Asn Leu Tyr Asp Phe Asp Phe Gly Trp Gly Lys 

385 390 395 

CCG ATA AAG TAT GAG ACT GTT TCA ATA GAC TAT AAT ACG TCG ATT TCT 1247 
Pro He Lys Tyr Glu Thr Val Ser He Asp Tyr Asn Thr Ser He Ser 
400 405 410 415 

ATA AAT GCA AGC AAA ACA TCA GCA CAA GAT CTT GAA ATT GGA TTG AGT 1295 
He Asn Ala Ser Lys Thr Ser Ala Gin Asp Leu Glu He Gly Leu . Ser 

420 425 430 

CTA CCG AGT ATG CAA ATG GAG GCG TTT TCT AGC ATC TTT GAT GAA GGA 1343 
Leu Pro Ser Met Gin Met Glu Ala Phe Ser Ser He Phe Asp Glu Gly 

435 440 445 

TTA GAG AGT CAA GTT TCA TTG TAGATCATCG TCCCCTTTTT GTGTGCATCA 1394 
Leu Glu Ser Gin Val Ser Leu 
450 454 
AGTTTCTGTC GTTTTTATGA GTTGCCACTG TTCTATTCTT TAAGTATACC TTTCGACTAT 1454 
GTTTTGAAGA TGCAACGATA TAAAATGAAA AAAAAAAAAA AAAAAAAAAA AAAA 1508 
SEQ ID NO: 6 
SEQUENCE LENGTH: 1521 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL SEQUENCE: No 
ANTI-SENSE: No 
ORIGINAL SOURCE 

ORGANISM: Lavandula angustifolia 

TISSUE TYPE: petal 
IMMEDIATE SOURCE 

LIBRARY: cDNA library 

CLONE: pLAT21 
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SEQUENCE DESCRIPTION 

-TG ACC ACC CTC CTC GAA TCC TCC CGA GTG GCG CCG CCT CCA GGC ACG 
Xxx Thr Thr Leu Leu Glu Ser Ser Arg Val Ala Pro Pro Pro Gly Thr 
15 10 15 

GTG GCT GAG CAG TCA CTC CCG CTC ACC TTC TTC GAC ATG ACG TGG CTG 
Val Ala Glu Gin Ser Leu Pro Leu Thr Phe Phe Asp Met Thr Trp Leu 

20 25 30 

CAT TTC CAC CCC ATG CTT CAG CTT CTC TTC TAC GAA CTC CCC TGT TCC 
His Phe His Pro Met Leu Gin Leu Leu Phe Tyr Glu Leu Pro Cys Ser 

35 40 45 

AAA CCC GCC TTC CTC GAA ACC GTC GTT CCG AAA CTC AAA CAA TCC TTA 
Lys Pro Ala Phe Leu Glu Thr Val Val Pro Lys Leu Lys Gin Ser Leu 

50 55 60 

TCT CTA ACC CTC AAA CAC TTC TTC CCC CTT TCA TGC AAT CTA ATC TAC 
Ser Leu Thr Leu Lys His Phe Phe Pro Leu Ser Cys Asn Leu lie Tyr 

65 70 75 

CCT CTA TCG CCG GAG AAA ATG CCG GAG TTC CGG TAT CAG AAC GGT GAC 
Pro Leu Ser Pro Glu Lys Met Pro Glu Phe Ser Val Ser Phe Thr He 
80 65 90 95 

TCG GTT TCT TTC ACG ATT ATG GAG TCT GTC GGA GAT CAT CCG CAT TCC 
Met Glu Ser Ser Asp Asp Tyr Glu Asp Val Gly Asp His Pro His Ser 

100 105 110 

GCT CAT AAA TAC TAC TGC TTT GCC CCT AGC GAC GAT TAT GAA GAT CTC 
Ala His Lys Tyr Tyr Cys Phe Ala Gin Leu Pro Pro He Val Glu Glu 

115 120 125 

CAG CTG CCG CCG ATA GTC GAG GAA TCT GAT CGG AAA TTG TTT CAA GTT 
Ser Asp Arg Lys Leu Phe Gin Val Pro Leu Arg Tyr Gin Asn Gly Asp 

130 135 140 

TTA GCC GTG CAA GTG ACT CTG TTT CCC GGT CGC GGG GTG TGC ATC GGA 
Leu Ala Val Gin Val Thr Leu Phe Pro Gly Arg Gly Val Cys He Gly 

145 ISO 155 

ATA ACG ACG CAC CAC ACC GTT AGC GAT GCT CCA TCG TTT GTA GGG TTT 
He Thr .Thr His His Thr Val Ser Asp Ala Pro Ser Phe Val Gly Phe 
160 165 170 175 

ATG AAG AGT TGG GCT TCC ATC ACT AAA TTC GGA GGA GAT GAT GAA TTC 
Met Lys Ser Trp Ala Ser He Thr Lys Phe Gly Gly Asp Asp Glu Phe 
180 185 190 
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TTG GAC GGA AAA GGT GAA TGT TTG CCG GTT TTC GAC CGA TCG CTC GTG 
Leu Asp Gly Lys Gly Glu Cys Leu Pro Val Phe Asp Arg Ser Leu Val 

195 200 205 

AAT TAT CCG CCT AAA TTG GAC ACA TAT TTA TGG AAC AAC GCG CAG AAA 
Asn Tyr Pro Pro Lys Leu Asp Thr Tyr Leu Trp Asn Asn Ala Gin Lys 

210 215 220 

CGT CCG TTG GAA TCG CAG CAT CCA TCT TTA CCG ACG GAT CGG ATT CGA 
Arg Pro Leu Glu Ser Gin His Pro Ser Leu Pro Thr Asp Arg He Arg 

225 230 235 

GCT ACC TAC CTT TTC ACC CAA TCT GAA ATT AAG AAA TTG AAG GGT TTG 
Ala Thr Tyr Leu Phe Thr Gin Ser Glu He Lys Lys Leu Lys Gly Leu 
240 245 250 255 

ATT CAG AGA AAA GCC CCA AAT GTA GTT AAT CTC TCT TCC TTC GTC GCG 
He Gin Arg Lys Ala Pro Asn Val Val Asn Leu Ser Ser Phe Val Ala 

260 265 270 

ATC GCA GCT TAT ATC TGG ACC GGC ATC GCC AAA TCG GTC GGA GAT TAC 
He Ala Ala Tyr He Trp Thr Gly He Ala Lys Ser Val Gly Asp Tyr 

275 280 285 

AAA GAC GTG GAT GAC GAC AAA CGC GCT TTC TTT TTA ATT CCG ATC GAT 
Lys Asp Val Asp Asp Asp Lys Arg Ala Phe Phe Leu He Pro He Asp 

290 295 300 

TTA AGG CCG CGT TTG GAT CCG CCG GCT CCG GGG AAC TAC TTC GGA AAC 
Leu Arg Pro Arg Leu Asp Pro Pro Ala Pro Gly Asn Tyr Phe Gly Asn 

305 310 315 

TGT CTA TCG TTT GCG ATG GCG AAG ATC CTG CGG CGG GAT TTG GTC GGA 
Cys Leu Ser Phe Ala Met Ala Lys He Leu Arg Arg Asp Leu Val Gly 
320 325 330 335 

GAT GAA GGG GTG TTT CGG GCA GCT GAG GCG ATC GCG GCG GAA ATA GAG 
Asp Glu Gly Val Phe Arg Ala Ala Glu Ala He Ala Ala Glu He Glu 

3.40 345 350 

AAG AGG ACG AGC GAC AAG AAG ATT CTA GAA ACT GTG GAG AAC TGG CCG 
Lys Arg Thr Ser Asp Lys Lys He Leu Glu Thr Val Glu Asn Trp Pro 

355 .360 - 365 

TCT GAG ATT. CGC GAA GCC TTG CAA AAC TGT TAT TTC TCG GTG GCG GGA 
Ser Glu He Arg Glu Ala Leu Gin Asn Cys Tyr Phe Ser Val Ala Gly 
370 375 380 
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TCG AGC AGG CTT GAT CTT TAC GGC GCG GAT TTT GGA TGG GGT AAG GCG 1199 
Ser Ser Arg Leu Asp Leu Tyr Gly Ala Asp Phe Gly Trp Gly Lys Ala 

385 390 395 

GTG AAG CAA GAG ATA CTG TCG ATT GAT GGA GAG AAG TTT ACG ATG TCG 124 7 
Val Lys Gin Glu He Leu Ser He Asp Gly Glu Lys Phe Thr Met Ser 
400 405 410 415 

TTG TGT AAA CCG AGG GAT GCT GCC GGA GGA TTG GAG GTT GGA TTG TCT 1295 
Leu Cys Lys Pro Arg Asp Ala Ala Gly Gly Leu Glu Val Gly Leu Ser 

420 425 430 

TTG CCA AAG GAG GAA TTG CAA GCT TTT GAT GAT TAT TTT GCG GAG GGA 1343 
Leu Pro Lys Glu Glu Leu Gin Ala Phe Asp Asp Tyr Phe Ala Glu Gly 

435 440 445 

ATA AAG GGT TGATTAATCA TTTAATCATG TATTATGAAG TTGGATGAAA 13 92 

He Lys Gly 
450 

TCCTCTGTTT CATCTCTATT GTTTAAACAA TAATTTTTTT CCATTGAACT TTTTTGAGTC 1452 

AATAAAAAAA AAAAAAAAAA AAAAAAAATG AAAAAACTCA GTTATTTTTT TTTTTTTTTT 1512 

TTTTTTTTT 1521 

SEQ ID NO: 7 

SEQUENCE LENGTH: 10 

SEQUENCE TYPE: amino acid 

TOPOLOGY: linear 

MOLECULE TYPE: peptide 

HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 

Arg Phe Leu Gly lie Thr Gly Ser Pro Lys 

15 10 
SEQ ID NO: 8 
SEQUENCE LENGTH: 8 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 
He His Met Asp Ala Phe Ala Lys 

1 5 
SEQ ID NO: 9 
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SEQUENCE LENGTH: 10 

SEQUENCE TYPE: amino acid 
5 TOPOLOGY: linear 

MOLECULE TYPE: peptide 

HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 
10 Gly Val Glu lie Gly Val Ser Leu Pro Lys 

15 10 

SEQ ID NO: 10 
15 SEQUENCE LENGTH: 8 

SEQUENCE TYPE: amino acid 

TOPOLOGY: linear 

MOLECULE TYPE: peptide 
20 HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 

Ala Ser Leu Ser Leu Thr Leu Lys 
1 5 

SEQ ID NO: 11 

SEQUENCE LENGTH: 14 

SEQUENCE TYPE: amino acid 
30 TOPOLOGY: linear 

MOLECULE TYPE: peptide 

HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 
35 His Tyr Val Pro Leu Ser Gly Asn Leu Leu Met Pro lie Lys 

1 5 10 

SEQ ID NO: 12 

SEQUENCE LENGTH: 14 

40 

SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
45 HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 

Val Arg Ala Thr Tyr Val Leu Ser Leu Ala Glu He Gin Lys 
15 10 
50 SEQ ID NO: 13 

SEQUENCE LENGTH: 8 
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SEQUENCE TYPE: amino acid 

TOPOLOGY: linear 
5 MOLECULE TYPE : peptide 

HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 

lie His Met Asp Ala Phe Ala Lys 
1 5 

SEQ ID NO: 14 

SEQUENCE LENGTH: 9 
1S SEQUENCE TYPE: amino acid 

TOPOLOGY: linear 

MOLECULE TYPE: peptide 

HYPOTHETICAL SEQUENCE: No 
20 SEQUENCE DESCRIPTION 

Lys He His Met Asp Ala Phe Ala Lys 
1 5 

SEQ ID NO: 15 
25 SEQUENCE LENGTH: 8 

SEQUENCE TYPE: amino acid 

TOPOLOGY: linear 
30 MOLECULE TYPE: peptide 

HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 

Lys He His Met Asp Ala Phe Ala 

35 1 5 

SEQ ID NO: 16 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 
40 STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 
4S HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 

AARATHCAYA TGGAYGCITT YGC 2 3 

SEQ ID NO: 17 
so SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 
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STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 
CTCGAGTTTT TTTTTTTTTT TTT 



23 



SEQ ID NO: 18 
SEQUENCE LENGTH: 26 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 

TTCACCATGG AGCAAATCCA AATGGT 26 

SEQ ID NO: 19 

SEQUENCE LENGTH: 17 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 

CGAGTCGCCC TCATCAC 17 

SEQ ID NO: 20 

SEQUENCE LENGTH: 16 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPEr synthetic DNA 

HYPOTHETICAL SEQUENCE: No 
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SEQUENCE DESCRIPTION 
AACAGCTATG ACCATG 
SEQ ID NO: 21 
SEQUENCE LENGTH: 6 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 



16 
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MOLECULE TYPE: peptide 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 
Asp Phe Gly Trp Gly Lys 



so 



10 



15 



1 5 
SEQ ID NO: 22 
SEQUENCE LENGTH: 17 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 
GAYTTYGGIT GGGGIAA 
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SEQ ID NO: 23 

SEQUENCE LENGTH: 21 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 



SEQ ID NO: 24 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 

CCATGTCAGG TGTGAGGTTC AAC 23 
SEQ ID-NO: 25 - 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 



HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 
TGGCAACTGT CTTGCGTCAT G 



21 



so 



STRANDEDNESS: single 
TOPOLOGY: linear 
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MOLECULE TYPE: synthetic DNA 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 
ATCGTTTCGC ATGATTGAAC 
SEQ ID NO: 26 
SEQUENCE LENGTH : 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 
TCAGAAGAAC TCGTCAAGAA 
SEQ ID NO: 27 
SEQUENCE LENGTH: 53 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 

GGGATCCAAC A ATG GAG CAA ATC CAA ATG GTG GCC GTG ATC GAA ACG TGT 
Met Glu Gin He Gin Met Val Ala Val He Glu Thr Cys 
15 10 

AGA 

Arg . . . 

15 

SEQ ID NO: 28 
SEQUENCE LENGTH: 16 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA - 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 
GTAAAACGAC GGCCAT 
SEQ ID NO: 29 
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SEQUENCE LENGTH: 45 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 

GGGATCCAAC A ATG GAG CAA ATC CAA ATG GTG AAC ATT CTC GAA C 45 
Met Glu Gin lie Gin Met Val Asn He Leu Glu 
1 5 10 

SEQ ID NO: 30 
SEQUENCE LENGTH: 21 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: synthetic DNA 
HYPOTHETICAL SEQUENCE: No 
SEQUENCE DESCRIPTION 

CTCGGAGGAA TTCGGCACGA C 2 

SEQ ID NO: 31 

SEQUENCE LENGTH: 35 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: synthetic DNA 

HYPOTHETICAL SEQUENCE: No 

SEQUENCE DESCRIPTION 

AGTCGGATCC AACA ATG ACC ACC CTC CTC GAA TCC 3 
Thr Thr Leu Leu Glu Ser 
1 5 



Claims 

1 . Gene encoding a protein having aromatic acyl group transfer activity or a derivative thereof having said enzymatic 
activity. 

2. The gene according to claim 1 obtainable by cloning using as the primer a nucleotide sequence encoding the 
amino acid sequence as set forth in SEQ ID No. 21 . 

3. The gene according to claim 2 wherein said primer has the nucleotide sequence as set forth in SEQ ID No. 22. 
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4. The gene according to claim 1 or 2 encoding any of the amino acid sequences as set forth in SEQ ID No. 1 to 6, or 
a modified amino acid sequence in which the above amino acid sequence has been modified by addition or 
removal of one or more amino acids, or substitution with other amino acid(s). 

s 5. The gene according to claim 1 or 2 encoding a protein, which gene is capable of hybridizing with part or all of the 
nucleotide sequence encoding any of the amino acid sequences as set forth in SEQ ID No. 1 to 6 under the condi- 
tion of 5 x SSC and 50°C. and which protein has aromatic acyl group transfer activity. 

6. The gene according to claim 1 or 2 encoding a protein, which gene is capable of hybridizing with part or all of the 
10 nucleotide sequence encoding any of the amino acid sequences as set forth in SEQ ID No. 1 to 6 under the condi- 
tion of 2 x SSC and 50°C and which protein has aromatic acyl group transfer activity. 

7. The gene according to claim 1 or 2 encoding a protein which has an amino acid sequence having a homology of at 
least 1 5% or higher with any of the amino acid sequences as set forth in SEQ ID No. 1 to 6, and which has aromatic 

15 acyl group transfer activity. 

8. The gene according to claim 1 or 2 encoding a protein which has an amino acid sequence having a homology of at 
least 30% or higher with any of the amino acid sequences as set forth in SEQ ID No. 1 to 6. and which has aromatic 
acyl group transfer activity. 

so 

9. A vector comprising a gene according to any of claims 1 to 8. 

1 0. A host transformed with a vector according to claim 9. 

25 1 1 . A host according to claim 10 wherein said host is a microbial or animal cell. 

12. A host according to claim 10 wherein said host is a plant cell or a plant body. 

1 3. Protein encoded by a gene according to any of claims 1 to 8. 

30 

14. Protein having aromatic acyl group transfer activity which is obtainable by affinity chromatography of a crude 
enzyme extract of a plant with a resin on which Cibacron Blue 3GA is immobilized. 

1 5. Protein capable of specifically biding to anttoody to the protein of claim 13 or 1 4 and which has aromatic acyl group 
35 transfer activity. 

16. A method for producing a protein having aromatic acyl group transfer activity wherein a host according to claim 10 
is cultured or grown, and then a protein having aromatic acyl group transfer activity is recovered from said host 

40 1 7. A method for producing a protein having aromatic acyl group transfer activity wherein a crude enzyme extract of a 
plant is treated with an affinity chromatography using a resin on which Cibacron Blue 3GA is immobilized. 

18. A method for producing a protein having aromatic acyl group transfer activity, comprising the step of specifically 
binding to antibody to the protein of any of claims 13 to 15. 

45 

19. A method for acylating a pigment, comprising bringing said pigment to the action of the protein of any of claims 13 
to 15. 

20. A method for acylating a pigment in a plant, comprising the steps of introducing a gene according to any one of 
so claims 1 to 8 into the plant, allowing said gene to express, and acylating the pigment in the plant with the protein 

produced. 

21 . A method for stabilizing a pigment wherein said pigment is acylated by the action of the protein according to any of 
claims 13 to 15. 

55 

22. A method for stabilizing a pigment in a plant, comprising the steps of introducing the gene according to any of 
claims 1 to 8 into a plant, allowing said gene to express, and acylating the pigment in the plant with the protein pro- 
duced. 
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23. A method for controlling the color of flowers, comprising the steps of introducing the gene according to any of 
claims 1 to 8 into a plant, allowing said gene to express, and acylabng the pigment in the plant with the protein pro- 
duced. 

5 24. The method according to any of claims 19 to 23 wherein the pigment is anthocyanin. 

25. A plant whose color has been controlled by introducing thereinto a gene according to any of claims 1 to 8, or its 
progeny having the same property, or tissues thereof. 

10 26. The plant tissue according to claim 25 wherein said tissue is a flower. 

27. A cut flower of the plant according to claim 25 or its progeny having the same property. 



45 
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